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PROJECTIONS OF THREE TYPES OF 
FACTOR PATTERN 


CHESTER W. HARRIS * 
University of Wisconsin 


In transforming any given correlation matrix, 


R, into a primary factor solution, two steps are 


commonly taken. The first is to transform the 
n by n correlation matrix, of rank m, into an 
arbitrary orthogonal factor matrix of size n by 
m, such as the centroid matrix, F. The second 
is to rotate this factor matrix into the desired 
primary factor pattern. This second step ob- 
viously depends upon a concept of the desired 
primary factor pattern. An indefinite number 
of rotations might be made, and consequently 
one has no notion of when to cease this opera- 
tion unless an exact step-by-step specification 
of the number and character of the rotations to 
be made is set up beforehand, or a criterion 
for recognizing the primary solution is avail- 
able. An attempt is made in this paper to give 
a geometric formulation of the primary factor 
solution that provides such a criterion for 
certain types of desired factor pattern. One 
approach to the translation of this into an 
analytic criterion will then be suggested. 


The method to be used is simple in outline. 
A particular type of figure plus its analogues 
in higher-dimensional space will be postu- 
lated as the model for the primary solution. 
This figure is the tetrahedron, a three-dim- 
ensional figure formed by four triangles, 
which consequently has four apexes and six 
edges. The projections of this figure and of 
its analogues into a space of one less dimen- 
Sion will then be specified. Next, certain 
factor patterns will be specified as desired 
ones, and the projections of the centroids 
they generate will be shown to be consistent 
with the projections of certain regions of the 
postulated model. Since certain of these pro- 





jected regions are represented as points and 
lines, one avenue to the development of an 
analytic criterion may be opened up. 


The Model and its Projections 





An essential characteristic of the extended- 
vectors representation of a set of centroids 
is that it projects the test vectors into an 
(m - 1)-space.+ In order to relate the post- 
ulated model to the extended-vectors repre- 
sentation of certain types of factor pattern, 
it will therefore be necessary to develop both 
the m-dimensional model and its projection 
into (m - 1)-space. The tetrahedron has been 
described above. Specify one apex as the 
fixed origin to correspond with the factor prob- 
lem in which the test vectors have a common 
origin. Let the three planes that meet at this 
fixed origin each be defined by a large number 
of test vectors, but let the ‘‘interior’’ of the 
tetrahedron be hollow. If each test vector is 
now extended into the plane opposite the orig- 
in, these test vectors project as points that 
define three lines arranged in the form of a 
triangle. This is, of course, the familiar ex- 
tended-vectors representation of an ‘‘ideal’’ 
three-factor primary pattern.2 


Two principles are to be established. For 
the 3-space model (the tetrahedron) each of 
the three coordinate planes projects as a line, 
which is defined by the points representing 
test vectors that lie in that plane. The three 
lines have a special arrangement, since they 
form a triangle. In general, then, the projec- 
tion of the coordinate hyperplanes of an m- 
space figure will be required to yield the 





* I am deeply indebted to Professor J. R. Mayor, Department of Mathematics, Uni- 
versity of Wisconsin, for reading this paper in manuscript, and suggesting 


@ number of changes. 


Any errore that remain are solely mine. 


. See L. L. Thurstone, Multiple-Factor Analysis (Chicago: University of Chi- 


cago Press, 1947), Chapter 


2, See Thurstone, op. cit., 


» fOr a description of this representation. 


p. 228, for an illustration. 
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(m - 1)-space figure that constitutes a hyper- 
plane of the m-space figure. By beginning 
with the triangle as a hyperplane of the tetra- 
hedron, it is possible to define an indefinite 
series of projections of configurations that lie 
in higher-dimensional space. The triangle and 
the tetrahedron also give the principle for spec- 
ifying the number of apexes, bounding hyper- 
planes, and intersections of bounding hyper- 
planes for any such configuration. If m equals 
the space within which the figure lies, then 

m + 1 gives the number of apexes, m + 1 also 
gives the number of bounding hyperplanes, and 
(m + 1) (m) gives the number of intersections 


of these bounding hyperplanes, pair by pair. 


In order to define the 4-space hypertetra- 
hedron, make use of the two principles given 
above. First, its projection in 3-space will 
be a tetrahedron. Because we wish to use 
the extended-vectors representation in deal- 
ing with the factor problem, however, this 
projection in 3-space will be regarded as a 
skeleton or outline of a tetrahedron. The dis- 
tinctive characteristics of this skeleton will 
be given by six lines so arranged as to meet 
in four points in 3-space; these lines are, of 
course, defined by points that are the pro- 
jections of the test vectors. In contrast to its 
projection, the 4-space figure should have five 
apexes, five bounding hyperplanes, and ten 
intersections of these. Again assume that co- 
ordinate test vectors define the four coord- 
inate hyperplanes, i.e., those that meet at the 
special apex called the origin. The require- 
ment is that each of these hyperplanes pro- 
ject as a hyperplane of the next lowest dim- 
ensional figure, i.e., as a three-sided figure 
lying in a plane, or a triangle. The skeleton 
of the tetrahedron, which is the projection of 
the 4-space hypertetrahedron, has four such 
bounding planes; each one therefore represents 
the projection of a hyperplane of the 4-space 
figure. But such a hyperplane is by definition 
a tetrahedron in which only three planes are 
coordinate with the origin and defined by test 
vectors; therefore each such hyperplane should 
project as a triangle. Defining the 4-space 
hypertetrahedron as five tetrahedrons so ar- 
ranged as to meet in five points in 4-space is 
therefore consistent. Of these five tetrahedrons, 
only four are coordinate with the fixed origin 
and consequently are defined by test vectors. 
The fifth tetrahedron therefore is taken as the 
3-space model of the projection of the test 
vectors defining this 4-space figure, just as 
the triangle opposite the origin is taken as the 
2-space model of the projection of the test vec- 
tors defining a tetrahedron. Such a definition 





of the model and its projection ag be general- 
ized to a space of any dimensions. 


It is necessary to account for the six lines 
postulated in the projection of the 4-space hy- 
pertetrahedron in terms of planes within the 
4-space figure. In general, 


m+i=h+k 
The intersection, i, of a (m - 1)-space hyper- 
plane, h, and a plane k, in an m-space gives 
a line (1-space), thus: 


m+i=(m - 1) +2 
i=1 


and 


Similarly, any line, such as a test vector, in- 
tersects with such a hyperplane as a point. 

The test vectors therefore project into the 

(m - 1)-space as points. The test vectors de- 
fining coordinate hyperplanes project as points 
that define lines. These lines, therefore, rep- 
resent planes of the coordinate hyperplanes. 
This is obvious in the projection of the three- 
factor problem for which the tetrahedron is the 
model, since for it the coordinate hyperplanes 
are simply the three planes that project as 
lines. The six planes, each represented by a 
line in the projection, of the 4-space figure 
present a problem, however. Each of the four 
coordinate hyperplanes of the 4-space hyper- 
tetrahedron is conceived as a tetrahedron. 
Because of the restriction of a common origin 
for the test vectors, however, test vectors may 
lie in only three of the four bounding planes of 
any of these tetrahedrons. The expectation 
might therefore be that three times four, or 
twelve, such planes would project as lines. An 
additional restriction is necessary. It is that 
the four coordinate tetrahedrons of the 4-space 
figure must be arranged in 4-space in a manner 
analogous to the arrangement of the four planes 
of a tetrahedron in 3-space. Each of these 
four coordinate tetrahedrons of the 4-space fig- 
ure will therefore have a plane in common 

with every other such tetrahedron. This means 
that only one-half of these twelve planes, or 
six of them, are distinct in the arrangement of 
the four hyperplanes. 


It can be demonstrated that each apex of the 
4-space figure is required by these restrictions 
to be a point. This distinctive arrangement 
specifies that an apex is the common intersec- 
tion of m of the m + 1 hyperplanes. The inter- 
section of two hyperplanes, in m-space, gives 
a lower dimensional m - 2 hyperplane; of three 
hyperplanes, a lower dimensional m - 3 hyper- 
plane; and so on. Therefore the intersection 








3. Sommerville calls this model a simplex of m dimensions. 
would be, in his terminology, a simplex of m - 1 dimensions. 
Sommerville, An Introduction to the Geomet 


Methuen and Co., Ltd., ° pters 


Ite projection 
See D. M. Y. 


of NW Dimensions (London: 
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of m hyperplanes in an m-space gives a figure 
of m - m dimensions, or a point. Since each 
apex of the m-space is a point, each pair of 
apexes may be joined by a straight line. The 
requirement of coordinate test vectors, how- 
ever, restricts our interest to the m such 
lines that may join the special apex called the 
origin to each of the other m apexes. If these 
lines project into the {m - 1)-space, they will 
project as points. Now consider the projec- 
tion of the m-space figure. The apexes of the 
projection will number exactly m; each such 
apex is a point and therefore represents the 
projection of a line of the m-space figure. 
Further, each such apex of the projection is 
the common intersection of m - 1 of the hyper- 
planes of that projection. Since each hyper- 
plane of the projection represents a hyper- 
plane of the m-space figure, it follows that 





each of these m lines represents the intersec- 
tion in m-space of m - 1 hyperplanes of the 
m-space hypertetrahedron. The general 
theorem indicates that the intersection of 

m - 1 hyperplanes in an m-space defines a 
line. The apexes of the projection therefore 
represent certain special lines formed by 

the intersection of m - 1 of the coordinate 
hyperplanes of the m-space figure. The 
origin is a point on each of these m lines. 


With the definition of the 4-space figure 
and its projection, the definition of higher- 
space figures becomes a matter of translat- 
ing each one into the projection of the next 
higher one. A summary may be given as 
follows: 


The Hypertetrahedron 


Space in which Number of Number of co- Space in which 
the hypertetrahedron apexes and ordinate hyper- each coordinate 
lies of hyperplanes planes hyperplane lies 
3 4 3 2 
4 5 + 3 
5 6 5 4 
m m+1 m m-1l 
The Projection 
Space in Space in Number of Configuration of the pro- Configuration of the 
which the whichthe coordinate jection of each hyperplane total projection 


hypertetra- projection hyperplanes 


hedron lies lies that project 
3 2 3 one line in 1-space a triangle 
4 3 a three lines in 2-space a tetrahedron 
5 a 5 six lines in 3-space a 4-space hypertetrahedron 
m m-1 m a (m-1)-space hyper- 


(m-1) (m-2) 
mile 


tetrahedron 


lines in (m-2)-space 
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Representation on a Plane 





In addition to defining the projections of the 
m-space configuration, it is necessary to find 
a way of representing each one on a plane. 
This is no problem for the projection of the 
three-factor problem, since its model, the 
tetrahedron, projects as a plane figure. The 
4-space hypertetrahedron, however, projects 
as a 3-space figure--the skeleton or outline of 
a tetrahedron. This could be constructed in 
space, but the projection of any higher-space 
figure could not, since the projection of any 
one of these occupies at least a 4-space. A 
plane representation is also needed in dealing 
with the factor problem. To secure one, we 
may cast the shadow, so to speak, of the pro- 
jection onto a plane surface. The diagrams 
of Figure 1 illustrate plane representations 
of the projection of a 4-space hypertetrahedron. 
Since m is four, four points are specified and 
then connected pair by pair with straight lines. 
Each of these diagrams is a familiar figure, 
since it is a line drawing of a tetrahedron in 
perspective. Although the six lines actually lie 
in a plane, they can be ‘‘seen’’ to outline a 3- 
space figure. 


The reason for the three diagrams in Figure 1 
can be explained readily. If the projection lies 
in a three or higher space, there is a large num- 
ber of possible representations on a plane of 
the same figure. Although each such represen- 
tations would be different from every other, in 
an absolute sense, there would be many that 
would show the same relative arrangement of 
lines and points. The three distinctive arrange- 
ments of the representation of the tetrahedron 
are shown as Figures la, 1b, and ic. Three 
orthogonal axes may be placed in a 3-space. 
Each representation in Figure 1 gives the fig- 
ure in perspective from a point on one of these 
three axes. The number of mutually orthogonal 
planes that can be erected in the space occupied 
by the projection gives the number of distinctive 
relative arrangements of the points and lines 
of the representation. Therefore, the projection 
of an m-space hypertetrahedron may be repre- 
sented on a plane in (m - 1) (m - 2) distinctive 

2 
ways. Each such representation will also be 
seen to be a rotation through one dimension of 
of any other representation. Letters are used 
to designate the four apexes in the diagrams of 
Figure 1 so that this rotation may be noted. 


By joining points with straight lines, therefore, 
the projection may be represented on a plane in 
a certain number of distinctive ways. The num- 
ber of points is given by m, since this is the 





number of lines joining the origin and each of 


the other apexes of the m-space figure that pro- 

ject into the (m - 1)-space. The number of 

lines in the projection is given by (m) (m - 1), 
a ine 


since this is the number of distinctive planes 
making up the m coordinate hyperplanes of the 
m-space hypertetrahedron that project into the 
(m - 1)-space. This is true because a plane in 
the m-space hypertetrahedron will be determ- 
ined by the intersection of m - 2 hyperplanes. 
Since planes formed by the intersections of only 
the m coordinate hyperplanes will project, the 
number of such planes is given by the combina- 
tion of m things taken m - 2 at a time, which re- 
duces to (m)(m- 1). The projection of a 5- 

2 
space hypertetrahedron thus may be represented 
on a plane by five points joined by ten straight 
lines; of a 6-space hypertetrahedron by six 
points joined by fifteen straight lines; and so on. 


Factor Patterns 





A factor pattern will be defined as the coef- 
ficients of a set of linear equations describing 
the composition of the variables in terms of the 
factors. This is Holzinger’s definition.4 These 
coefficients also are the Cartesian coordinates 
of the variables with respect to the axes defined 
by the factors. A factor pattern may be repre- 
sented as a matrix of size n by m, where n des- 
ignates the number of variables and m the num- 
ber of common factors. In this representation, 
uniqueness is disregarded. The term ‘‘primary 
factor pattern’’ will designate that the variables 
are described in terms of primary factors. 
Since the major problem of this paper is the 
geometric formulation of the primary factors, 
no attempt is made to define thema priori. How- 
ever, the primary factors will be permitted to be 
oblique, as a general case, with orthogonal pri- 
maries as a special case. Three types of pri- 
mary factor pattern will be considered in this 
paper. Each is defined in terms of the maxi- 
mum complexity of any variable. Complexity 
refers to the number of non-zero coefficients 
in the linear equation describing the variable in 
terms of the factors. Thus a variable of com- 
plexity one will have only one such coefficient, 
a variable of complexity two, two such coeffic- 
ients, etc. The cluster pattern will be defined 
as a pattern in which the maximum complexity 
of any variable is one, a 2-multiple-factor 
pattern as one in which the maximum complex- 
ity of any variable is two, and a 3-multiple- 
factor pattern as one in which the maximum 
complexity of any variable is three.5 Ficti- 
tious exampies of these three types of pattern 
may readily be constructed by specifying num- 
erical values for the non-zero coefficients. 

The centroids they generate may then be com- 





4. Karl J. Holzinger and Harry H. Harman, Factor Analysis (Chicago: University 


of Chicago Press, 1941), pp. 16-18. 





o 
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puted and the vectors extended to unit correla- 
tion with the first centroid. Plots of the var- 
ious pairs of the columns, other than the first, 
of this extended-vector matrix then give the 
complete extended-vectors representation of 
the pattern. Since the extended-vector repre- 
sentation of three-factor patterns is well 
known, fictitious patterns for a four-factor 
problem will be illustrated.§ 


The cluster pattern, when m equals four, 
may be represented: 


Pj Po Ps P4 


A + 0 0 0 
B 0 + 0 0 
c 0 0 + 0 
D 0 0 0 + 


where P; , Po, Fy , Pg designate the four prim- 
ary axes, and A, B, C, D each designate a 
colinear cluster of tests. Each test in a given 
cluster is of complexity one, designated by the 
single plus sign in that row. The entries in 

the matrix of intercorrelations of the four pri- 
mary axes, @, need meet only the restrictions 
that are imposed upon a matrix of intercorrel- 
ations of unit vectors that are linearly indepen- 
dent; this permits the case of orthogonal pri- 
maries, or Holzinger’s uni-factor pattern. The 
three extended-vectors diagrams of the cen- 
troids generated by such a factor pattern are 
given in Figure 2. 


It is not surprising that each diagram reveals 
only the extensions of the four clusters. Two 
points are of importance here. The four clus- 


JOURNAL OF EXPERIMENTAL EDUCATION 





339 


ters have different positions relative to each 
other in each of the three diagrams. Figure 
2b, for example, represents a rotation through 
one dimension of the (hypothetical) line joining 
clusters A and B as represented in Figure 2a. 
Figure 2c is related to Figure 2b and to Fig- 
ure 2a in a similar fashion. These, then, rep- 
resent the three distinctive views possible in 
the (m - 1)-space of the extended-vectors rep- 
resentation of a four-factor problem. Second, 
each of these diagrams suggests the projection 
of the assumed hypertetrahedron. If straight 
lines were used to join each pair of the four 
clusters, the postulated projection represented 
in Figure 1 would be reproduced. In other 
words, the model is consistent with the pro- 
jections of a cluster pattern, but these projec- 
tions define only limited regions of this figure. 


The fact that this pattern limits the complex- 
ity of each test to one accounts for this and 
also defines the particular regions as regions 
of complexity one. 


Viewed from the point of view of the primary 
factor pattern, these regions of complexity one 
must be the primary axes. Further, the pri- 
mary factor pattern postulates that each pri- 
mary axis is a vector, rather than say, a plane 
or a solid, that is coordinate with the common 
origin of the test vectors. It was seen above 
that such distinctive vectors are located in the 
m-space hypertetrahedron by the intersections 
of the various combinations of m - 1 of the m 
coordinate hyperplanes. It was also shown that 
such vectors project as points that form the 
apexes of the projection. Equating the regions 
of complexity one with the primary axes is 
therefore consistent. It follows that any test 
vector that lies in a region of complexity one 








5 


See Holzinger and Harman, op. cit., pr. 106-108, for a summary of types of 
preferred vatterns. I have taken the liberty of generalizing their uni- 
factor pattern to the oblique case and calling it a cluster pattern. I also 
have split the multiple-factor pattern into more than one type, depending 
upon the extent of the overlap of the group factors, and used numbrals to 
designate these. Thurstone's criteria for simple structure, op. cit., 

pp. 334-335, may also be stated in terms of maximum complexity. 


One is the case of an initial orthogonal 


Two cases deserve consideration. 
The other is the case of an initial oblique 


solution, such as the centroid. 
solution. In order to simplify the treatment, only the former is discussed 
here. The principles, however, are general and apply to the extended-vectors 


representation of an initial oblique solution as well. This fact leads to an 
interesting variant of Holzinger's simple method of factor analysis; dis- 
cuesion of it will be reserved for a subsequent paper. 


Only a schematic representation is used in this paper. In preparing the 
diagrams, however, numerical entries were substituted for the plus signs, 
the correlation matrix calculsted, and the centroids extracted. The primary 
factor pattern, P, and the intercorrelations of the primaries, 0) » yield 

R, as follows; 


PQOP'=R 


See Holzinger and Harman, op. cit., Chapter XI and Appendix B.11. 
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lies in the intersection of m - 1 of the coord- 
inate hyperplanes of the m-space hypertetra- 
hedron, If it does, then it necessarily has 
zero coordinates with respect to the axes form- 
ed by the other intersections of the various 
combinations of m hyperplanes taken m - 1 at 
a time, i.e., with respect to the other primary 
axes. Viewed in Thurstone’s manner, such a 
test vector is seen to have zero correlations 
with the normals to the m - 1 hyperplanes at 
whose intersections it lies. Consequently, it 
has a non-zero correlation with a normal to 
only one hyperplane--the one not included in 
the group of m - 1. 


The regions of complexity two are also of 
considerable interest. A limited 2-multiple- 
factor pattern will be used to illustrate these. 
The limitation is that only four of the six pos- 
sible combinations of complexity two will be 
included, thus: 


Py P9 P3 P4 


A + 0 0 0 
1 + + 0 0 
B 0 + 0 0 
2 0 + + 0 
Cc 0 0 + 0 
3 0 0 + + 
D 0 0 0 + 
4 + 0 0 + 


The three projections of this pattern are given 
in Figure 3. Straight lines have been drawn 
through the variables indicated by numerals 
and joining pairs of clusters. These lines pass 
through all points representing the four given 
combinations of complexity two. Again, the 
three plane representations, each a rotation 

of another through one dimension, are evident. 
Though incomplete, the projection of the hyper- 
tetrahedron is clearly suggested, and we may 
conclude that the postulated model is consistent 
with this type of factor pattern. In order to 
complete the projection it is necessary only to 
add variables representing the remaining two 
combinations of complexity two. If this is done, 
their projections lie between A and C and be- 
tween B and D on each diagram; these then 
locate the remaining two straight lines needed 
to complete the figure. 


A region of complexity two constitutes a plane 
in the m-space hypertetrahedron that is pro- 
jected as a line. It is bounded, so to speak, by 
two regions of complexity one, or two of the 
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vectors that are formed by the intersection of 

m - 1 of the coordinate hyperplanes. In other 
words, it is a plane that spans twoof the primary 
axes. Test vectors lying in such a plane must 
have zero coordinates with respect to all pri- 
mary axes except those two. Since such a plane 
has been defined above as the intersection of 

m - 2 hyperplanes, a test vector lying in this 
plane is therefore uncorrelated with the normals 
of m - 2 of the coordinate hyperplanes. In gen- 
eral, a test vector lying in a region formed by 
the intersection of m - x of the coordinate hyper- 
planes of the m-space hypertetrahedron has a 
complexity of x, i.e., its primary pattern will 
have non-zero coordinates with respect to x of 
the primary axes. A variable of complexity m is 
therefore a non-distinctive variable (for that 
factor pattern) since its projection cannot con- 
tribute to the delineation of any hyperplane. 


Figure 4 gives a schematic representation of 
all the regions defined by a four-factor pattern 
in which the maximum complexity of any vari- 
able is three. This would be, in the notation 
used previously, a 3-multiple-factor pattern. 
A, B, C, and Dare each of complexity one, and 
1, 2, 3, and 4 are of complexity two, as before. 
The remaining combinations are: 


Py Po P3 P4 


5 + 0 + 0 
6 0 + 0 + 
w + + + 0 
x + + 0 + 
y + 0 + + 
z 0 + + + 


In combination with A and C, 5 defines a 
straight line on the projection. This line is 

the region of complexity two, P; - Pg, and the 
line joining B, 6, and D is the region of com- 
plexity two, Pg - P4. It should be clear that 
any number of variables might lie along and 
consequently define such lines in the projec- 
tion of actual problems. The four regions of 
complexity three are indicated by the four tri- 
angles formed by the six straight lines. Thus 
the lines forming the triangle ABC define the 
region w, of complexity three, on each diagram. 
Again, many variables might lie in such regions 
in actual problems. Variables of complexity m 
(four, in this case) have been called non-distinc- 
tive variables, since their projections do not de- 
fine any of these distinctive regions. They may, 
for example, lie either within or without the 
four-sided figure, ABCD. In the latter case they 
have at least one negative pattern value. 
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The five-factor problem may be interpreted in 
a similar fashion. In the extended-vector repre- 
sentation, five points define the regions of com- 
plexity one, and ten straight lines joining the 
various pairs of these points define regions of 
complexity two. The regions of complexity three 
are seen to be triangles in the plane represen- 
tation, and the regions of complexity four, 
four-sided plane figures. For an illustration, 
the reader should examine Thurstone’s fic- 
titious five-factor problem that he used to illus- 
trate his conception of the use of the extended- 
vectors representation in solving the rotation 
problem.° The regions of complexity one, two, 
three, and four for this hypertetrahedron can be 
recognized readily. If a factor pattern similar 
to those discussed in this paper fits a given 
six-factor problem, then the projection of the 
6-space hypertetrahedron should be consistent 
with the configuration of extended test vectors 
on the various diagrams. In theory, at least, 

m may be any number; the practical problem of 
recognizing these projections when m is large 
is another matter. 


Applications and Further Froblems 





The major point of this paper is that a partic- 
ular type of configuration can be shown to be 
a satisfactory geometric model for certain 
types of primary factor pattern. Such a postu- 
late should make possible several develop- 
ments and applications. For one, this postu- 
late should clarify considerably the problem of 
‘‘what to look for’’ in attempting a primary solu- 
tion of a given matrix, R. A factor pattern may 
be regarded as an hypothesis to be tested by the 
empirical data of a correlation matrix. If a 
cluster pattern or multiple-factor pattern of 
maximum complexity less than m is a good fit 
for the empirical data, then the set of diagrams 
of the extended-vectors representation of the 
test configuration will reveal this by their con- 
sistent correspondence with the projection of 
the m-space hypertetrahedron. For a config- 
uration of test vectors made up of exactly m 
distinct clusters, the choice of a new coordinate 
system--which constitutes the rotation problem 
--can be accomplished by taking the m points 
defined by the projected clusters, plus the origin 
as the m + 1 points of the simplex of reference, 
or new coordinate system.9 The model is here 
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characterized by its correspondence with the 
configuration of test vectors with respect to 
regions of complexity one only. 


Other problems are less simple. For them, 
the choice of a new coordinate system may be 
seen as posing this problem of correspondence 
with respect to regions of complexity greater 
than one. If the factor pattern is to be made 
up only of positive or zero coefficients, then it 
is evident from this development that the exten- 
ded-vectors representation of the test configur- 
ation must be bounded by regions of complexity 
two, whose intersections determine the regions 
of complexity one. Further, the regions of 
complexity three must also be bounded by reg- 
ions of complexity two, as must regions of com- 
plexity four, five, etc. This view, then, lays 
great stress on identifying the test vectors 
that lie in the planes that span the primary 
axes.!0 Not all students will judge this to be a 
desirable emphasis. Some, for example, will 
prefer to take certain variables or clusters of 
variables as factorially ‘‘pure’’ and then 
analyze all the variables in terms of these, 
regardless of the maximum complexity or 
the signs of the coefficients this gives to the 
pattern. Both approaches are undoubtedly 
useful. If one chooses the former approach, 
however, then this postulate of the m-space 
hypertetrahedron and its projection should 
offer some help in the identification of the 
new coordinate system from the set of extend- 
ed-vectors diagrams. 


This postulate also provides a needed geo- 
metric basis for the calculation procedure of 
direct rotation to primary structure. This pro- 
cedure has been given in detail aoastanaiee,t 
Briefly, if the plane representation of the pro- 
jection of an m-dimensional hypertetrahedron 
is consistent with the empirical data given 
as the extended-vectors representation of the 
centroids, then it should be possible to identify 
the m regions of complexity one. Each of 
these is represented by points whose coordin- 
ates are consistent on the various diagrams. 
These coordinates are direction numbers which, 
when normalized, give the transformation 
matrix, T, that transforms the centroids direct- 
ly into the primary structure. Both the inter- 
correlations of the primary axes and the pri- 





8, Thurstone, op. cit., pp. 232-236. Onl 


and therefore the remaining two should be constructed. 


y four of the six diagrams are given, 
It is now painfully 


clear that some of my earlier statements regarding the “proper” appearance 


of these projections were in error. 
to Primary Structure," Journal of Educ 


See Chester #. Harris, "Direct Rotation 


ational Psychology, XXXIX (December 





1948), pp. 449-468. 


9. See Sommerville, op. cit., p. 55. 
10. 

vectors representation (op. cit., Chap 
ll. Harris, op. cit. 


This point emphasizee the difference between Thurstone's view of the extended- 


ter XI) and mine. 
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mary pattern may be calculated from these 
data. The obvious advantage of the procedure 
is that only a single rotation is made. The 
geometric formulation given in this paper de- 
fines these regions of complexity one as the 
projections of the primary axes. It also de- 
fines the primary axes as vectors formed by 
the intersection of m - 1 hyperplanes. The 
procedure of taking the coordinates of the 
regions of complexity one as the direction 
numbers of the primary axes thus follows 
logically. 


An important direction in which inquiry 
might turn is that of the types of figure that 
constitute the models for other types of factor 
pattern. It is well known that the factor solu- 
tion is indeterminate and that one pattern may 
be transformed into another within the restric- 
tions of certain boundary conditions. It is not 
known, however, how many different types of 
factor patterns are distinctive in the geometric 
sense of demanding different types of figure as 
the model. For example, the factor pattern 
that consists of a general factor plus bipolar 
factors might be studied in this fashion. One 
suspects that some such patterns are not 
uniquely bipolar, since their projections would 
be consistent with the projection of the hyper- 
tetrahedron. On the other hand, other bipolar 
patterns may be unique in the sense that a 
different model is required. This line of 
inquiry might lead eventually to the definition 
of distinctive types of correlation matrices 
in terms of geometric models. 


Another avenue of inquiry may lead to an 
analytic solution in terms of primary factors. 
Certain suggestions grow out of the develop- 
ment given here. Any region of complexity two 
appears as a straight line, bounded by two 
points of complexity one, on the extended-vec- 
tors representation. Further, this region is a 
straight line on all the diagrams, and all test 
vectors that project on such a line maintain the 
same position relative to each other and to the 
two boundaries on all diagrams. Assume that 
it is possible to specify, a priori, one of the 
regions of complexity one. This can often be 
provided for in designing an actual factor study 
by including a cluster of tests that are known to 
be relatively ‘‘pure’’ factorially. This region 
of complexity one may then be made the origin 
for the extended-vectors matrix; all that is 
necessary is to subtract from each column the 
coordinate of that specified point. It then 
follows that all variables that consistently lie 
on a line with that point will have a constant 
slope for each of the (m - 1) (m - 2) pairs of 

2 


columns. Assume a five-factor problem, 
with We Xj» Vj» and zj as the non-zero extend- 


ed-vectors coordinates when referred to the 
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specified region of complexity one as origin. 
The six slopes, for any variable j, are: 


“. “i, “i, “and 9) 
Zz; 


x 
7.395 43.5 *j 


All variables defining a region of complexity 
two must meet the criterion that the sum of 
these six slopes equals a constant. This can 
be written: 


K = aj + (aj + 1)b; +[(a, + 1)b; + Je, 


x ‘ 
where aj 1, bj ot, and cj , Such a 
j yj - 
constant can be defined for m of any size. 
When expressed in the form given here, it will 
always have exactly m - 2 terms. Such acriter- 
ion, K, will identify all groups of variables that 
lie in straight lines, on all the diagrams, with 
the point taken as the arbitrary origin or reg- 
ion of complexity one. In an ‘‘ideal’’ problem, 
such a procedure would identify m - 1 of the 
groups of variables whose projections lie in 
these regions of complexity two. 


Variables lying in a region of complexity 
three must be characterized by a slope, with 
respect to any pair of axes of the plane repre- 
sentation, that consistently lies between limits 
given by the slope of variables of complexity 
two that bound the region of complexity three. 
A method of sorting variables into those that 
may and those that may not lie in a given 
region of complexity three is therefore given. 
The principle of such a sorting process may 
be extended to regions of complexity of m - 1. 
Sorting for variables that lie in regions of 
complexity three should also suggest one or 
more groups of variables that, since they may 
not lie in a region of complexity three, may 
determine one of the unknown regions of com- 
plexity two. Such an hypothesis would then 
lead to solving the equations of the lines de- 
lineated by the groups of variables of constant 
slope for their consistent points of intersec- 
tion. In this manner, an analytic solution might 
be built up. This sketch of the procedure mere- 
ly suggests possibilities. A rigorous treatment 
and careful attention to the problem of a satis- 
factory measure of sampling fluctuation of the 
criterion K and of the slopes making up this 
criterion are needed in order to provide a 
sound statistical basis for such a procedure. 
Further, these present suggestions are all 
limited by the requirement that one region of 
complexity one has been specified a priori. 


It seems plausible that the reasoning given 
in this paper should lead eventually to the 
formulation of certain conditions that must 
hold for R if a primary factor pattern of the 
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type illustrated provides an adequate fit. The 
determination of these conditions may then 
lead to a primary solution that is derived 
directly from R, rather than from an inter- 
mediate factor matrix. At present, it appears 
that these conditions would have to be ex- 
pressed as relatively complicated relation- 
ships in which the unknown communalities 
would play an important role, For example, 

a necessary condition for test vectors that lie 
in a plane is that the angular distance, express- 
ed in radians, e.g., between the two variables 
lying farthest apart is exactly equal to the sum 
of the angular distances between each pair of 
variables taken in sequence with respect to 
their position. The inclusion of any variable 
outside this plane in the group would destroy 





this relation, since the great circle is the 
shortest path between two points on a sphere. 
Angular degrees of separation of test vectors 
can be transformed into cosines of these angles, 
and the cosines into correlation coefficients 
divided by the respective lengths of the pair of 
vectors.!2 Thus, such a condition could be 
written in terms of the correlation matrix 
entries. Further exploration would be needed 
in order to determine the additional conditions 


that yield the (m) (m - 1) planes that intersect 
2 


in m common points. Exploration of the possi- 
bility of writing these conditions and then of 
making use of them to secure a direct solution 
would seem to be warranted, however. 





12. This assumes that m is to be taken as the minimum rank of R. 
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THE DEVELOPMENT OF A TECHNIQUE FOR THE 
MEASUREMENT OF SOCIAL-EMOTIONAL 
CLIMATE IN CLASSROOMS 


JOHN WITHALL * 
University of Chicago 


Rationale.--In the study! proper a brief over- 
view of current theories of learning from the 
associationist and field-theorist point of view 
precedes a statement of the major concepts, 
gleaned largely from the field-theorists, that 
guide the study. Certain postulates regarding 
the motivational factors in personality and re- 
garding learning conditions are proposed, The 
primary motivational force of human behavior 
is postulated to be a drive toward self-actual- 
ization. This drive is said to be influenced by: 


1. Need for self-consistency 

2. Interaction in terms of an internal 
frame of reference 

3. Self-directive behavior 

4. Achievement of personal significance 
and private meanings in a social 
milieu. 


It is postulated that learning (changes in be- 
havior) is most likely to occur when experiences 
are both 


1. meaningful to the learner, that is, are 
areal by the learner as pertinent to 
his needs and purposes, are consistent 
with his personality organization, and 
are associated with self-directive be- 
havior; and 


2. occur in a non-threatening situation, 
that is, the learner is free from a sense 
of personal threat, interacts with others 
in a wholesome social milieu, and is 
helped to evaluate himself on the basis 
of objective criteria. 








Since condition number 1 is postulated to be 
dependent on condition number 2, some knowl- 
edge of the psychological atmosphere represent- 
ed by number 2 is necessary. Little objective 
evidence exists regarding psychological atmos- 
pheres either in learning or other situations. It 
was decided, therefore, to attempt to develop a 
technique to measure social-emotional climate 
in the classroom through a categorization of 
teacher-statements. It seems reasonable to 
assume that the teacher’s behavior influences 
the conditions of learning since she is placed 
in the classroom by society to manipulate the 
conditions so as to facilitate learning. 


Assumptions and hypothesis.--The basic as- 
sumptions of the study are (1) that the social- 
emotional climate is a group phenomenon; (2) 
that the teacher’s behavior is the most impor- 
tant single factor in creating climate in the 
classroom, and (3) that the teacher’s verbal 
behavior is a representative sample of her 
total behavior. 





The hypothesis to be tested is that by means of 
a categorization of teacher-statements a valid 
and reliable index of social-emotional climate 
can be obtained. 


Work Previously Done in Area of Climate 





Some work has been done in the area of clim- 
ate, notably by Ronald Lippitt at lowa Univer- 
sity and Harold H. Anderson at Michigan State 
College. 





* The author is indebted to several members of the faculty of the Department 
of Education at the University of Chicago for their counsel and construc- 


tive criticisms during the carrying out of the study reported here. 


He 


is particularly grateful to both Dr. Herbert A. Thelen of the Department 
of Education and the Department of Psychology, and to Dr. Carl R. Rogers 
of the Department of Psychology, University of Chicago, for their facil- 
itation of the exverimenter's efforts to make the study as ecientifically 
sound and as theoretically fruitful as possible, 
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Lippitt’s Study 


Lippitt’s work in social climate is best repre- 
sented by his doctoral study.2 In that study he 
organized four clubs of five boys each and gave 
each club successive experiences with an ‘‘aut- 
ocratic’’ and ‘‘democratic’’ leader during three 
consecutive six-week periods. Several leaders 
were used to head each of the clubs. The lead- 
ers were required to employ different leader- 
ship styles with each successive group. The 
leadership styles were implemented in accord 
with certain specific criteria drafted to guide 
the club leaders. Records of social interac- 
tion between group members and leader, steno- 
graphic records of conversation in each club, 
analysis of activity subgroupings, and a running 
account of psychologically interesting interac- 
tion in each group were among data collected 
by observers of each club session. 


The major conclusions of Lippitt’s study were: 


1. That different leadership styles pro- 
duced different social climates and re- 
sulted in different group and individ- 
ual behaviors; 


2. That conversation categories differen- 
tiated leader-behavior techniques more 
adequately than social-behavior cate- 
gories; 


3. That autocratic leadership elicited 
either an aggressive rebelliousness 
towards the leader or an apathetic 
submission to the leader; 


4. That leadership style was the primary 
factor in producing climatological dif- 
ferences and that club personnel was 
of secondary importance. 


Lippitt’s work represents one of the earliest 
and most significant attempts to observe and 
control the climate variable in a group situation. 
His findings regarding the value of categorizing 
verbal behavior as a means of assessing the 
quality of group life provides a sound basis for 
the methodology of this study which utilizes a 
categorization of teachers’ verbal behaviors as 
its major technique. 


Anderson’s Work 


Harold H. Anderson, Joseph E. Brewer and 
others have conducted at Michigan State College 





investigations$ into the influence of teachers’ 
classroom personalities on children’s behavior, 
particularly at the primary and elementary 
school levels. In order to obtain objective 
measurements of teachers’ classroom person- 
alities and concomitant children-behavior, 
Anderson and Brewer developed twenty-six 
teacher-behavior categories and twenty-nine 
children behavior categories by which both 
teacher and pupil verbal and non-verbal be- 
havior might be categorized. Anderson divid- 
ed teacher-behaviors into two main kinds -- 
Integrative Teacher Behavior and Dominative 
Teacher Behavior. Integrative behavior was 
that which expanded the children’s opportunities 
for self-directive and cooperative behavior 
with the teacher and their peers; dominative 
behavior tended to restrict children’s activi- 
ties and to lead to distracted, aggressive, non- 
cooperative conduct. Anderson demonstrated 
that children’s behaviors were consistent with 
the kind of personality the teacher displayed in 
the classroom. 


Anderson’s studies bring out evidence that is 
highly pertinent to the hypothesis that the main 
direction of influence in the classroom is from 
the teacher to the pupil. He has demonstrated 
too, that reliable patterns of teacher and pupil- 
behavior can be obtained in the classroom 
through categorizations of their overt behaviors. 


Development of Climate Index 





The concept of climate or psychological atmos- 
phere has been used by others in the area of 
psychology and education besides Lippitt and 
Anderson. Lewin, Prescott and Rogers, for ex- 
ample, have made considerable use of the con- 
cept. However, no clear-cut definition of the 
concept can be cited and for the purpose of 
more effective communication and clearer un- 
derstanding of the notion as used here, a def- 
inition of the term ‘‘social-emotional climate’’ 
is offered. 


Definition of social-emotional climate.--Clim- 
ate is considered in this study to represent the 
emotional tone which is a concomitant of inter- 
personal interaction. It is a general emotional 
factor which appears to be present in inter- 
actions occurring between individuals in face 
to face groups. It seems to have some rela- 
tionship to the degree of acceptance expressed 
by members of a group regarding each other’s 
needs or goals. Operationally defined it is 








2. Ronald Lippitt, 


f Applied Psychology Monogra 
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considered to influence: (1) the inner private 
world of each individual; (2) the esprit de co 

of a group; (3) the sense of meaningfulness 
group and individual goals and activities; (4) the 
objectivity with which a problem is attacked, 
and (5) the kind and extent of interpersonal in- 
teraction in a group. 


Procedure for identifying categories.--Pro- 
ceeding from our assumption that climate is 
largely determined by the teacher’s behavior, 
we undertook an analysis of the teacher’s verb- 
al-behaviors contained in sound recordings of 
regular class sessions. A teacher’s verbal- 
behavior is assumed to represent adequately 
her total behavior. Guided by the postulates 
regarding individual motivation and conditions 
of learning, teacher-statements were analyzed 
in order to ascertain whether the teacher was 
utilizing behaviors likely to create the postu- 
lated conditions for learning. Individual 
teacher-statements tended to fall into about 
twenty-five types of responses, e.g.: 





1. Reproof statements made with the 
apparent intent of halting the pupil’s 
present behavior and modifying future 
behavior; 


2. Questions seeking further information 
from pupil about problem; 


3. Statement offering simple administra- 
tive information, e.g., about place of 
next class meeting; 


4. Questions containing reproof and a 
plea for cooperation; 


5. Statement approving pupil's behavior 
and commending him; 


6. Statement urging a particular course 
of action on the learner; and so on. 


These twenty-five kinds of responses were 
soon found to overlap and to be not mutually 
exclusive. They were reduced to thirteen and 
finally to seven categories. These categories 
seemed to encompass all the types of state- 
ments which teachers utilized in classrooms. 
The categories are: 


1, Learner-supportive statements that 
have the intent of reassuring or com- 
mending the pupil. 


2. Acceptant and clarifying statements 
having an intent to convey to the pupil 
the feeling that he was understood and 
help him elucidate his ideas and feel- 


ings. 





3. Problem-structuring statements or 
questions which proffer information 
or raise questions about the problem 
in an objective manner with intent to 
facilitate learner’s problem-solving. 


4. Neutral statements which comprise 
polite formalities, administrative com- 
ments, verbatim repetition of some- 
thing that has already been said. No 
intent inferrable. 


5. Directive or hortative statements with 
intent to have pupil follow a recommended 
course of action. 


6. Reproving or deprecating remarks in- 
tended to deter pupil from continued 
indulgence in present ‘‘unacceptable’’ 
behavior. 


7. Teacher self-supporting remarks in- 
tended to sustain or justify the teacher’s 
position or course of action. 


It seemed that more than one continuum was 
identifiable in the seven categories; for ex~ 
ample, a continuum from problem-centeredness 
to person-centeredness might be discerned, a 
continuum from objectivity to subjectivity or a 
continuum from learner-centeredness to teach- 
er-centeredness. The latter continuum was 
accepted as useful in applying the categories 
to teachers’ verbal behaviors. Categories 1, 
2, and 3 were said to be learner-centered and 
categories 5, 6, and 7 were said to be teacher- 
centered. The neutral category had no influ- 
ence on either bloc. Certain conventions were 
suggested for interpreting patterns of verbal 
behavior; for instance, if the proportion of 
statements falling into one or more of the first 
three categories outweighed the proportion 
falling into one or more of the last three cat- 
egories, the teacher was said to be learner- 
centered, If the proportion were reversed, the 
teacher was said to be teacher-centered. Sim- 
ilarly, if the proportion of statements falling 
into category 3 outweighed the proportions 
falling into either categories 1 and 2 combined 
or categories 5, 6, and 7 combined, then the 
teacher was said to be more problem-centered 
than learner or teacher-centered. If the 
largest proportion fell into the combination of 
either categories 1.and 2 or categories 5, 6, 
and 7, then the teacher was said to be more 
highly learner-centered or more self-centered 
as the case might be. Once the seven categor- 
ies had been identified and procedures develop- 
ed for facilitating their application to data, the 
next step was to ascertain the objectivity, re- 
liability, and validity of the technique. 
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Determiaing the pbjectivity, Reliability, 
mate 


Objective Study 


The objectivity of the instrument was ascer- 
tained by having four trained judges apply the 
index independently to typescripts 1A, 2A, and 
3A comprising 68, 71, and 45 teacher-state- 
ments respectively. The percentage of agree- 
ment of each judge with the researcher was 
computed. The percentage of agreement was 
computed by the formula: 





Number of identical state- 
ments which researcher 





and judge place in a given 
Pererae® _cateory = 100 
agreement Number of statements 


which researcher places 
in same category 


The mean percentage of agreement between the 
judges and researcher on the three typescripts 
was: 


Judge 1 64 percent 
Judge 2 66 percent 
Judge 3 56 percent 
Judge 4 75 percent 


The highest mean percentage of agreement of 
the four judges with the researcher on the three 
typescripts was 78 percent on category 1 and 
the lowest 53 percent on category 7. The grand 
mean percentage of agreement for four judges 
with the researcher on the three typescripts 
65 percent. 


A further measurement of the index’s objec- 
tivity was made by computing tetrachoric cor- 
relations between the categorization of each of 
the five judges (the researcher included) on the 
three typescripts. Two hundred ten r'¢tS were 
computed to ascertain the degree of association 
between each of the judges with one another on 
each category in turn. 


The range of correlation and median rt¢t cor- 
relation of five judges’ was: 








Typescript Range Md, 
1A -.09 to .95+ 84 
2A -20 to .95+ -76 
3A 36 to .95+ -93 
Reliability Study 


The size of an adequate sample was empiric- 
ally determined by the classic procedure of 
adding further data to ascertain at what point 
addition of further data caused little or no 
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change in the obtained pattern of statements. 
Successive blocs of fifty statements from the 
typescripts of one class session were added to 
each other and the fluctuation in pattern of 
statements falling into the seven categories 
noted. (Table I) 


Standard errors were computed to ascertain 
whether the differences between the successive 
percentages in each category and the mean were 
significant, that is, were three times the stand- 
ard error. No significant differences were 
found except with regard to the percentage of 
category 3 statements based on the first fifty 
and first one hundred statements respectively, 
and for the first fifty statements of category 4. 
The other 32 differences were not reliable. 


Tt was concluded that 200 statements would 
offer an adequate sample of a given teacher’s 
statements since the statements processed rep- 
resented a class session drawn at random from 
several class sessions. The class from which 
the sample was drawn was, as regards size of 
group, resources, classroom arrangement, et 
cetera, in no way peculiar or biased. 


A further check on the consistency of the in- 
dex was made by comparing the day to day 
variations of the pattern of statements of three 
teachers, A, B, andC. 


The chi-square test was used to check the 
hypothesis that no significant differences 
occurred from day to day. It was not possible 
to reject the hypothesis insofar as teachers 
B and C were concerned. The P’s for these 
samples were .14 and .42 respectively. In the 
case of teacher A, however, a P of .02 was ob- 
tained which suggested that other than chance 
factors were operative. An ‘‘explanation’’ for 
this low P might be found in the fact that sample 
7 alone contributed more than half the total val- 
ue of chi-square and was a deviate session in 
that the teacher, contrary to her usual pattern 
of behavior was compelling the class to carry 
out a project to which they objected bitterly. 
The whole tone of the class session was great- 
ly at variance with the usual tenor of this 
teacher’s session. If this sample 7 is omitted 
from the series, the P obtained for the rest is 
-54 which does not permit of the rejections of 
the hypothesis and permits us to say the differ 
ences are insignificant. 


Validation Procedures 


The validity study has employed four pro- 
cedures to validate the technique developed. 


1. We have used H. H. Anderson’s Teach- 
er Behavior Categories‘ as a criter- 
ion instrument that has been applied 
to the same data as the climate index. 
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TABLE I 


PERCENTAGE OF INCREMENT OR DECREMENT IN THE PROPORTION OF STATE- 
MENTS FALLING INTO EACH OF SEVEN CATEGORIES AS ADDITIONAL BLOCS 


OF STATEMENTS ARE ADDED 





Categories: 




















1 2 3 4 5 6 7 
% % % % % % % 
First fifty 
statements 0.0 0.0 0.0 0.0 0.0 0.0 0.0 
Fifty added 
to 50 0.0 0.0 2.0 -7.0 3.0 -1.0 3.0 
Fifty added 
to 100 2.7 0.7 3.3 -3.7 “1.7 -0.3 -1.0 
Fifty added 
to 150 -0.2 -0.7 3.7 -0.8 -2.7 -0.2 0.5 
Fitty added 
to 200 -0.1 0.0 1.0 0.4 -1.0 -0.1 -0.2 
Twenty-one added 
to 250 -1.6 -0.3 1.0 0.0 0.2 0.4 0.3 
TABLE 1 
I.D. RATIOS AND CLIMATE INDEX RATIOS ON TYPESCRIPTS 
1A, 2A, and 3A 
Typescript LD. ratio Ratio of learner-centered statements 
teacher-centered statements 

1A 6.3 2.4 

2A 1.6 2.2 

3A 0.4 0.2 
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2. We have related pupil’s recorded com- 
ments and feelings to the picture of 
climate in the classroom situation ob- 
tained by application of the climate 
index to the teacher’s statements. 


3. We have predicted from results ob- 
tained on typescripts the ratings that 
independent judges would make of 
teacher-behaviors in terms of a Teach- 
er-Characteristics Rating Scale. 


4. We have related the description of a 
classroom situation from three frames 
of reference--a categorizer using ob- 
jective criteria, judges’ratings of the 
teacher’s verbal behavior in a ‘‘live’’ 
situation and pupils registering their 
negative or positive feelings. 


Use of Anderson’s Teacher-Behavior Cate- 
gories.--Anderson has developed Teacher-Be- 
havior Categories by which to describe Teach- 
er’s Classroom Personalities. The categories 
distinguish between Integrative and Dominative 
teacher-behavior and are the basis for an 1.D. 
ratio” which represents the ratio of approving, 





cooperative, integrative behavior by the teacher. 


Tt can be shown that the I.D. ratio in many re- 
spects is similar to a proportion of learner- 
centered to teacher-centered verbal behaviors. 
LD. ratios and learner-centered to teacher- 
centered ratios are comparable since in each 
instance a ratio greater than 1.00 denotes a 
greater use of integrative, acceptant and facili- 
tating behaviors over dominative, unacceptant 
and limiting behaviors by the teacher. 


Anderson’s categories and the climate index 
categories were applied to the same data with 
the results found in Table II. 


Analyses by both Anderson’s and the climate 
index criteria appear to concur that a greater 
proportion of teacher-behaviors in the situa- 
tion represented by typescript 1A and 2A were 
integrative and acceptant whereas a greater 
proportion of teacher-behaviors in 3A were 
dominative and rejecting. 


Pupils’ evaluations.--The pupils’ evaluation 
of the teacher’s methods and of the classroom 
situation in general is admissible evidence 
which may sustain or contradict the picture of 





climate obtained in a given classroom. We have 


such pupil evaluations regarding teacher A in 


class W. We also have climate index indications 


of the major focus of concern of the teacher de- 
rived from categorizing typescripts 1A and 2A 
as well as a mean pattern obtained from 8 ran- 
dom samples drawn from the same teacher’s 
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class sessions. The climate index analysis 
shows that, on the whole, in order of impor- 
tance the teacher’s concern is first with her- 
self, second with the learner, and third with 
the problem. (Table IT) 


There is some evidence here which suggests 
that the teacher tended to place more em- 
hasis on interpersonal relationships than on 
structuring the objective problem in the learn- 
ing situation. The concensus of opinion on the 
part of the 25 pupils who filled out evaluation 
sheets was ambivalent and can be best sum- 
marized as follows: 


Approving 


Teacher was cooperative 

Teacher was unprejudiced and 
impartial 

I got new ideas 

Teacher helped me 


Disapproving 


T wasted considerable time 

Teacher ought to have guided and 
controlled class more 

I was bored 

Teaching method was unsatisfactory 


The picture obtained both from the climate in- 
dex and from the pupils’ evaluations tended to 
concur in depicting a teacher who evidenced 
more concern with personality factors in the 
learning situation than with the objective prob- 
lem elements. 


The comparison of these data afford another 
crude indication that the climate index categor- 
ization tends to represent with some accuracy 
the actual atmosphere obtaining in a classroom, 
at least, as seen through the eyes of the pupils. 


Teacher-Characteristics Scale.--Four judges 
who had not used the climate index and who had 
no knowledge of the criteria contained in the 
index were asked to scale typescripts 1A, 2A, 
and 3A according to two dichotomous lists of 
teacher-behaviors which were labelled ‘‘A’’ and 
**C”’ on a 9-point scale. The midpoint was 
marked ‘‘B’’. 





Point ‘‘A’’ was defined by such teacher-be- 
haviors as: 


1. Raising objectivite questions about 
procedures and purposes of learner 


2. Evaluating pupil-achievement in terms 
of stated, objective criteria 





4. Anderson et al., Op. cit., pp. 22-26. 
5. Anderson et al., op. cit., pp. 41-42. 
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TABLE Ill 


PERCENTAGE OF TEACHER A’S STATEMENTS FALLING INTO EACH OF 
THREE AREAS OF CONCERN 











Mean for Eight 
Typescript 1A Typescript 2A Random Samples 
Areas of Concern Percent* Percent* Percent 
Sustain herself 
(Categories 5, 6, 
and 7) 28.0 17.9 37.4 
Helping the 
learner (Categories 
1 and 2) 44.0 13.3 23.4 
Developing the 
problem (Cate- 
gory 3) 22.1 4.4 28.8 








*The remaining percentage in each instance is accounted for by the neutral 
category, number 4. 


TABLE IV 


COMPARISON OF WEIGHTED RATINGS OF TEACHER-CHARACTERSTICS 
RATING SCALE WITH CLIMATE INDEX RATIO OF LEARNER-CENTER- 
ED TO TEACHER-CENTERED STATEMENTS ON TYPESCRIPTS 1A, 




















2A, and 3A 
Weighted Rat ings Learner-centered 
Types || Jude | Iudge | Judge | Judge [Ratio of Toa cher-conlered 
statements 
1A 0.0 1.3 3.0 3.7 2.4 
2A 3.0 1.0 0.8 3.0 2.2 
3A -2.5 0.5 -2.0 -3.5 0.2 
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3. Helping learners to identify a wide 
choice of problem-solving activities, 
etc. 


Point ‘C’’ was defined by such teacher-be- 
haviors as: 


1, Limiting choices of behavior of the 
learner 


2. Evaluating the learner’s achievement 
on the basis of the teacher’s unstated, 
subjective criteria 


3. Setting the pace of problem-solving 
behaviors for the learner, etc. 


Ten such behaviors for ‘‘A’’ and ‘“‘C’’ were 
listed. The training given the judges was com- 
paratively brief--a forty minute session--and 
then they were asked to apply the scale to type- 
scripts 1A, 2A, and 3A. From the categoriza- 
tion of statements on the basis of the climate 
index criteria, it was predicted that the judges 
would place their ratings on the 9-point scaie 
between ‘‘A’’ and ‘‘B”’ for typescripts 1A and 
2A, and between ‘‘B’’ and ‘‘C’’ for typescript 
3A. The results were that the ratings were as 
predicted for typescripts 1A and 2A except that 
on 1A Judge 5 placed his rating exactly at ‘‘B’’ 
and were as predicted for 3A except that Judge 
6 placed his rating on the ‘‘positive’’ side of 
**B’’ instead of on the ‘‘negative’’ side as pre- 
dicted. Positive weights were given to the 
points from ‘‘B’’ to ‘‘A’’ with ‘‘B’’ marked 
zero and ‘‘A’’ marked plus 4; negative weights 
were given the steps from ‘‘B’’ to ‘‘C’’ being 
weighted minus 4. The resultant weighted 
ratings are represented in Table IV. 


Description of class situation from three 
frames of reference.--A further validation 
procedure involved the relating of data describ- 
ing an experimental classroom situation from 
three frames of reference: 








1. The objective frame of a categorizer 
using the criteria of the climate index 


2. The frame of observers rating teacher- 
statements in situ with the aid of illus- 
trative teacher-behaviors on a dicho- 
tomous scale 


3. The frame of the pupils in the class- 
room 


The experimenta! session was forty-five min- 
utes long. In the first part of the period, the 
teacher sought by means of his verbalizations 
to create a learner-centered climate and in 
the last part of the period a teacher-centered 
climate. 
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The data that are reported here refer to 
session Number 2 of Group ‘‘X’’ comprising 
ten pupils. In the categorization of teacher- 
statements by means of the climate index, it 
was found that of the eighty-three statements 
in the learner-centered part of the session: 


48.2 percent were learner-centered 
12.0 percent were teacher-centered 
39.8 percent were neutral 


Of the thirty-three neutral statements com- 
prising 39.8 percent neutral statements, 
twenty-one were simple verbalizations of 
pupils’ names. 


Of the 119 statements in the teacher-center- 
ed part of the session: 


31.9 percent were learner-centered 
55.5 percent were teacher-centered 
12.6 percent were neutral 


Of the fifteen neutral statements comprising 
the 12.6 percent, four were simple verbali- 
zations of pupils’ names. 


There is an obvious difference between the 
learner-centered and teacher-centered parts 
of the class session as regards the percent- 
ages of teacher-centered statements. The 
difference, however, between the proportions 
of learner-centered statements is not large 
quantitatively but differs considerably quali- 
tatively. For example, the percentage of 
category 1 remarks is nil and the bulk of the 
learner-centered statements falls into cate- 
gory 2 during the learner-centered portion of 
the lesson, whereas in the teacher-centered 
half of the lesson 24.4 percent of the total 
31.9 percent of learner-centered statements 
fall into category 1. These category 1 state- 
ments, described as learner-supportive, were 
mostly superficially commendatory remarks 
such as ‘‘Right!’’ ‘‘Good!’’ and ‘‘ Very good!’ 


The categorizations by means of the climate 
index show that there was a preponderance of 
learner-centered remarks over teacher-cent- 
ered remarks in the first portion of the session 
and a preponderance of teacher-centered over 
learner-centered remarks in the second por- 
tion of the session. 


During the class session proper, four judges 
who were trained therapists rated the teacher’s 
statements in terms of illustrative criteria 
drawn from the Teacher-Characteristics Rat- 
ing Scale. One end of the scale was designated 
**A’’, the opposite end ‘‘C’’ and the midpoint 
‘*B’’. If a judge rated a statement as falling 
between ‘‘A’’ and ‘‘B’’, it was interpreted as 
being learner-centered; if he rated between 
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TABLE V 


MEAN PERCENTAGES OF JUDGES’ RATINGS OF TEACHER-STATEMENTS 
USED DURING LEARNER-CENTERED PART OF SESSION 





Mean 





Percent Kind of Statement Range of 4 Judges’ Percentages 
63.8 learner-centered 45.0* to 77.7 

18.6 neutral 10.0 to 22.3 

17.6 teacher -centered 0.0 to 45.0* 





* Judge 4 was responsible for the 45 percent item in both instances. He 
explained afterwards that his bias had been to scale objective questions 
which sought information as probing-directive statements; the other judges 
interpreted them as problem-structuring and therefore as learner-centered 
comments. 


TABLE VI 


MEAN PERCENTAGES OF JUDGES’ RATINGS OF TEACH- 
ER-STATEMENTS USED DURING TEACHER-CENTERED 
PART OF SESSION 








Mean Range of 4 Judges’ 
Percent Kind of Statement Percentages 
2.9 learner-centered 1.5 to 5.7 
6.9 neutral 1.5 to 12.8 


90.2 teacher-centered 81.5 to 97.0 
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**B’’ and “‘C’’, it was considered to be teacher- 
centered; if he rated it at ‘‘B’’, it was inter- 
preted as a neutral statement. 


In the learner-centered part of the session 
the mean percentages of the four judges’ ratings 
of teacher-statements are found in Table V, 
and in the teacher-centered portion of the 
session the mean percentages of the four 
judges’ ratings are found in Table V1. 


The data arrived at from an objective cate- 
gorization of statements by means of the clim- 
ate index and that obtained from four judges’ 
ratings of statements in situ are in general 
agreement. 


For the experimental class periods each 
of the ten pupils of Group ‘‘X’’ was supplied 
with two push buttons; one labeled plus and the 
other minus. They were requested, (1) to push 
the plus button whenever they felt ‘‘good’’ at 
what the teacher said whether the remark was 
directed at them or not, and (2) to push the 
minus button whenever they felt badly at what 
the teacher said. An orientation session was 
held with the group prior to the experiment 
proper to get them acquainted with the equip- 
ment and procedure but no training was given 
them regarding the kinds of statements which 
were to evoke either negative responses or 
positive responses. 


It was the prediction of the researcher that 
more positive than negative reactions from 
the pupils would be registered in the learner- 
centered portion of the session, and more 
negative than positive reactions in the teach- 
er-centered part of the session. The recorded 
pupil reactions are found in Table VII. 


The positive reactions slightly predominated 
during the learner-centered part of the class 
session and the negative reactions predomin- 
ated during the teacher-centered , art of the 
class. 


These data derived from three frames of refer- 


ence: (1) a categorizer using climate index cri- 
teria, (2) the observers’ ratings of statements 
in situ, and (3) the pupils’ ‘‘feeling’’ reactions, 
on the whole, tend to concur in depicting the 
first part of the session as probably more 
learner- and problem-oriented, with a greater 
possibility of objective attitudes by the teacher 
and pupils to the learning situation, and the 
latter part of the lesson as more teacher cen- 
tered and more conducive to a subjective ori- 
entation by the teacher and pupils to the learn- 
ing situation. 
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By the procedures outlined in the preceding 
pages, the climate index was shown to have 
objectivity, reliability, and validity. 


Application of Climate Index to Excerpts 
from Five Different Classes 








To test the applicability of the index categories 
to regular classroom sessions, samples were 
drawn from the sound recordings of five reg- 
ular classes held in the laboratory classroom. 
These classes included a 7th grade mathemat- 
ics class, two 8th grade social science classes, 
an 8th grade art class, and a 9th grade Latin 
class. The classes met regularly in the class- 
room three or four times a week and were 
conducted by the regular class teacher in her 
usual manner. 


Sampling.--Eight random samples, each 
seven minutes in length, were drawn from the 
recorded class sessions of the art, mathemat- 
ics, and Latin class. In addition, one full 
class session was taken from each of the three 
aforementioned classes as well as from the 
two social science classes. A total of 23 
seven-minute excerpts (one sample from the 
Latin class was unusable) and five full class 
sessions were available from classes at the 
secondary school level. These excerpts from 
the sound records of the several class ses- 
sions were typed-up and the seven categories 
applied to the teacher-statements contained 
in the protocols. 


Patterns of statements for each of the four 
teachers were obtained and an attempt was 
made, largely for illustrative purposes, to 
interpret these patterns. A mean pattern of 
distribution of statements among the seven 
categories was obtained for each of teachers 
A, B, and C from the several excerpts from 
their respective classes. A pattern of state- 
ments for teacher D in classes W and Z, re- 
spectively, was also obtained from two full 
class sessions of the social science classes. 
(See Table VIII) 


Comment regarding teachers’ patterns of 
statements.-~Teacher B appears to use a 
larger proportion of learner-centered state- 
ments (categories 1, 2, and 3) than the other 
three. Teachers A, C, and Dz seem to use 
a somewhat similar proportion of teacher- 
centered remarks; teacher B uses a low pro- 
pane of teacher-centered remarks (categories 
, 6, and 7). Teacher D in both sessions uses a 
slightly larger proportion of problem-struct- 
uring remarks (category 3) than the other three 
teachers and teacher A uses the lowest percent- 








6. Greater concern with one's self and one's status in the group than with 


the objective problem. 
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COMPARISON OF NUMBER AND PERCENTAGE OF POSITIVE AND 
NEGATIVE ‘‘FEELING’’ REACTIONS OF PUPILS RECORDED 
DURING THE LEARNER-CENTERED AND TEACHER-CEN- 
TERED PARTS OF A CLASS SESSION 





























Learner-Centered Teacher-Centered 
**Feelings’’ Portion of Session Portion of Session 
Number of Number of 

Reactions Percent Reactions Percent 
Positive 54 54.0 72 30.2 
Negative 46 46.0 166 69.8 
Total 100 100.0 238 100.0 

TABLE VIII 


PERCENTAGE OF STATEMENTS LOCATED IN EACH OF THE CLIMATE 


INDEX CATEGORIES FOR TEACHERS A. B, C, AND D 















































Teacher 
A B Cc Dy D, 
Mean Mean Mean Mean Mean 
Category % % % % % 
1 18.2 20.3 8.6 2.5 4.7 
2 5.2 7.8 2.8 0.0 0.9 
3 28.8 39.7 30.7 53.0 41.1 
Total 2.2 67.8 42.1 55.5 46.7 
4 10.4 22.2 20.2 21.4 16.8 
5 14.8 6.8 23.9 13.7 20.6 
6 9.1 1.6 8.9 6.8 10.3 
7 13.5 1.6 4.9 2.6 5.6 
Total | 37.4 10.0 37.7 23.1 36.5 
Grand 
Total 100.0 100.0 100.0 100.0 100.0 
=497 N=408 N=525 N=117 N=107 
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age of such comments. Teacher B uses the 
largest proportion of category 1 statements and 
of category 2 remarks. In the teacher-centered 
area, teacher D, and teacher C use the largest 
proportion of directive (category 5) statements, 
and teacher B the lowest proportion of such 
statements. Teacher B uses the lowest propor- 
tion of reproving (category 6) remarks and of 
self-supportive (category 7) statements. 


A possible interpretation of B’s pattern of 
statements.--Teacher B would appear to be an 
individual who would offer verbal support and 
encouragement to the pupils (category 1) and 
would attempt, to a lesser extent, to convey to 
them her understanding of them and of their 
point of view. She possibly tries to keep the 
objective problem clearly defined and attempts 
to keep it the central object of attention and 
concern (category 3). She appears to utilize 
a small proportion of hortative statements 
(category 5). Her negative evaluations of 
pupil-behavior appear to be minimal (cate- 
gory 6). She appears to feel slight need for 
indulging in self-supportive and defensive 
comments. Here is a teacher, we might infer, 
whose methodology for facilitating learning is 
(1) to keep the learners well oriented to the 
objective problem, and (2) to maintain a help- 
ful and understanding attitude towards them. 
At the same time, she would seem to be able 
to keep her own needs well in the background. 





Application of Climate Index to a Higher 
Level of Instruction 








To test the applicability of the climate in- 
dex categories to a level of instruction beyond 
the secondary school level, typescripts derived 
from recorded sessions of a graduate seminar 
and from a counselor-training program were 
processed. 


Pattern of Statements Derived from a 
Categorization of Professor E’s Ver- 
bal Behavior 


The protocols containing Professor E’s ver- 
balizations are based on two 2-hour seminar 
meetings of graduate students. These were 
two consecutive sessions that were recorded 
and upon which the typescripts were based. 
(Table IX) 


Comments on Professor E’s pattern of 
statements.--A high degree of problem-orien- 
tation is evidenced in these two class situa- 
tions by the large proportion of problem- 
structuring comments (category 3). Positive 
evaluations (category 1) are minimal and nega- 
tive evaluations (category 6) are non-existent. 
Some verbal expression of acceptance of the 
students’ ideas and feelings is in evidence on 
the basis of the percentages falling into cate- 
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gory 2. The proportion of hortative statements 
(category 5) is not large. However, some evi- 
dence of a need to sustain himself and to be 
concerned about his status appears from the 
comparatively large (in relation to the rest of 
the pattern) proportion of category 7 statements 
in both sessions. 


Pattern of Statements Derived from Cate- 
gorization of Instructor F’s Verbal Be- 
havior 


The protocols of Instructor F’s sessions were 
based on three 1} hour sessions. They were 
random selections from a group of eight avail- 
able typescripts. The paucity of statements by 
the instructor, 150 in all for the entire three 
sessions, is both striking and interesting. 
(Table X) 


Comments regarding Instructor F’s pattern 
of statements.--instructor F uses a large pro- 
portion of clarifying and acceptant statements 
(category 2) in all three sessions. Encourag- 
ing and reassuring statements (category 1) are 
at a very minimum. Slightly less than 1/5 of 
his remarks are problem-structuring comments. 
Directive, reproving, and self-supportive state- 
ments are rarely used and are nearly all con- 
fined to session 1; the fact that the bulk of F’s 
self-concerned statements fell into session 1 
may be ‘‘explained’’ by the fact that it was the 
first meeting of the class. The whole pattern 
of F’s verbal behavior is explicable largely 
in terms of his adherence to the principles of 
client-centered psychotherapy. Counselors of 
that persuasion place great emphasis on the 
creation of permissive and acceptant atmos- 
phere by the use of clarifying and acceptant 
statements and other appropriate procedures. 
Instructor F clearly transferred his counsel- 
ing orientation into the classroom situation. 





It would appear from the analysis of both 
Professor E’s and Instructor F’s statements 
that the climate index categories are applicable 
to a higher level of instruction other than the 
secondary school level. 


Summary, Conclusions, and Implications 





Summary.--A technique has been developed 
for assessing the social-emotional climate in 
a classroom by categorizing teacher-statements 
contained in typescripts made from sound 
records of class sessions. The technique has 
been shown to have objectivity, reliability, and 
validity. 


Conclusions.-- 
1. Climate can be assessed and described. 
2. Several individuals can be trained to 











TABLE Xx 
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PERCENTAGES OF PROFESSOR E’S STATE- 
MENTS FALLING INTO EACH OF THE SEVEN 






































CATEGORIES 
Seminar 1 Seminar 2 
(N=102) (N=84) 
Category Percent Percent 
1 2.0 3.6 
2 4.9 8.3 
3 70.6 59.5 
Total 77.5 71.4 
4 8.8 11.9 
5 5.9 1.2 
6 0.0 0.0 
7 71.8 15.5 
Total 13.7 16.7 
Grand Total 100.0 100.0 
TABLE X 


PERCENTAGE OF INSTRUCTOR F’S STATEMENTS FALLING 


INTO EACH OF THE SEVEN CATEGORIES 


























Session 1 Session 2 Session 3 
(N=36) (N=62) (N=52) 

Category Percent Percent Percent 
1 0.0 8.0 0.0 
2 63.9 58.1 69.2 
3 22.2 21.0 15.4 
Total 86.1 87.1 84.6 
4 0.0 11.3 15.4 
5 2.7 0.0 0.0 
6 5.6 1.6 0.0 
7 5.6 0.0 0.0 
Total 13.9 1.6 0.0 
Grand Total | 100.0 100.0 100.0 
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. 


use the criteria of the climate index 
and achieve an adequate measure of 
agreement among one another in cate- 
gorizing statements in typescripts. 


3. A valid measure of social-emotional 
climate of groups is obtainable through 
a categorization of teacher-statements. 


4. Within the limits of behavioral and per- 
sonality variations of the climate index 
gives us a consistent pattern of verbal 
behavior for a given teacher from day 
to day. 


5. Different patterns of verbal behavior 
used by several teachers can be identi- 
fied. 


6. Statements categorized by the climate 
index as likely to produce ‘‘positive’’ 
feelings tend to be similarly categor- 
ized by impartial observers and tend 
to be reacted to with ‘‘positive’’ feel- 
ings by the individuals to whom they 
are addressed. 


7. Statements categorized according to 
the climate index as likely to produce 
‘‘negative’’ feelings tend to be similar- 
ly categorized by impartial observers 
and tend to be reacted to with ‘‘nega- 
tive’’ feelings by the individuals to 
whom they are addressed. 


Some revision of the climate index seems in 
order as a result of insights arising out of its 
application to several typescripts and to ‘‘live’’ 
classroom situations. These improvements 
should include: 


1. Clarification of category 1 in order to 
distinguish between objective and sub- 
jective positive evaluation of pupil be- 
havior; the former may be genuinely 
learner-supportive; the latter doesn’t 
appear to perform that function; clari- 
fication of category 5 
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2. More rigid definition of statements to 
be placed in category 4 which should 
contain perhaps only verbatim repeti- 
tions, polite formalities and adminis- 
trative comments 


3. Differentiation between types of problem- 
structuring statements 


4. Considering the possibility of sub-di- 
viding five of the seven categories 
(not 4 and 7) into ‘‘A’’ and “‘B”’ areas. 
The ‘“‘A’”’ area would represent little 
or no element of self-concern on the 
part of the teacher; the ‘‘B’’ area would 
represent a considerable element of 
self-concern and result in tingeing the 
statement with some affect. 


Implications for further research.--Some 
questions that arise out of this study are: 





1. What is the relationship between clim- 
ate and the quality of the learning that 
occurs in a classroom? 


2. To what extent is the climate in a given 
classroom a function of the personality 
of the teacher? 


3. To what extent do peer-group relation- 
ships influence the classroom climate? 


4. Of what value is the climate index to 
teachers in analyzing their own teach- 
ing methods ? 


5. How applicable is the climate index to 
the ‘‘live’’ classroom situation? 


The study described above represents an 
attempt to develop a technique for the measure- 
ment of social-emotional climate to the end 
that, ultimately, fuller understanding and con- 
trol may be achieved of one of the factors hy- 
pothesized to influence learning. 
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AN EXPERIMENTAL STUDY OF THE EFFECTIVE- 
NESS OF FILMS AND FILMSTRIPS 


IN TEACHING 


GEOMETRY“ 


DONOVAN A. JOHNSON 
University of Minnesota 


The Problem 


The general objective of the experiment was 
to investigate the claim that instruction in geom- 
etry which emphasizes visualization and appli- 
cation by the use of sound motion pictures or 
filmstrips is more efficacious in promoting 
learning than instruction which uses other sup- 
plementary aids. Specifically, this study evalu- 
ated experimentally the relative effectiveness of 
certain audio-visual aids as contrasted with con- 
ventional instruction on (1) informational learn- 
ing, (2) development of fundamental skills, 
(3) ability to make applications of geometric 
principles and facts, and (4) retention of these 
types of learning. Wherever the audio-visual 
aids were used in experimental classes, they 
replaced an equivalent amount, in time, of con- 
ventional instruction. 


The null hypothesis is: There is no difference 
between experimental and control classes in 
informational learning, problem solving skills, 
applicational learning, and retention of learn- 
ing resulting from the use of certain audio-vis- 
ual aids in classroom instruction. 


The Design of the Experiment 





The three requirements for a scientifically 
designed experiment from which clear cut con- 
clusions may be drawn are: (1) randomization, 
(2) replication, and (3) local control. 


In the present investigation, the selection of 
the sample as well as the selection of the treat- 
ment was made in a random manner. By the 
use of Tippett’s random numbers, twelve 
schools of all the schools within a radius of 
75 miles of Minneapolis and having at least two 
sections of plane geometry were selected to 
participate in the experiment. The school en- 
rollments of the participating schools range 
from 230 to 1500 and the enrollment in partic- 
ipating geometry classes ranged from 14 to 
35. Similarly the selection of the type of vis- 
ual aid to be used was made in a random man- 
ner. It is randomization that makes possible 
an unbiased estimate of the effect and gives 





validity to statistical tests of significance. 


The requirement of replication is satisfied in 
this experiment in that the investigation con- 
sisted of fifteen separate and independent ex- 
periments with a total of twenty-seven classes-~ 
participating. Replication provides the only 
means of providing the experiment with its 
own appropriate estimate of error and of dim- 
inishing experimental error since the error 
is inversely proportional to the square root of 
N where N equals the number of replicates. 

The number of cooperating schools in this ex- 
periment furnishes a broad base for drawing 
conclusions and guards against accepting gen- 
eralizations uncritically which often results 
from the single isolated experiment. I will 

be noted in the results of the statistical analysis 
that the outcomes are not the same in all 
schools. The selection of a few specific schools 
for this experiment would have yielded results 
that would have been unwarranted by the findings 
of the comprehensive investigation. If exper- 
iments are to contribute to educational knowl- 
edge, significant educational problems should 
be attacked on a broad base. 


Each of the fifteen experimental units was a 
self-contained experiment since each had its 
own experimental group and control group. 
The evidence was determined for each exper- 
iment and collated in certain cases. These 
local control groups eliminate the errors re- 
sulting from the use of experimental and con- 
trol groups each in different localities or 
under different teachers. Registration methods 
in each of the participating schools provided 
for random, heterogeneous grouping of pupils 
and thus prevented bias from entering into 
the composition of any of the classes. By a 
random selection one of the classes in each 
school was designated as the control class 
and others as experimental classes. The ex- 
perimental classes in each school specify the 
groups which used certain visual aids as a 
part of their regular instruction. The control 
class specifies the group taught by the same 
teacher in the same school but for which no 
films or filmstrips were provided. The con- 
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ipating geometry classes ranged from 14 to 
35. Similarly the selection of the type of vis- 
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ner. It is randomization that makes possible 
an unbiased estimate of the effect and gives 





validity to statistical tests of significance. 


The requirement of replication is satisfied in 
this experiment in that the investigation con- 
sisted of fifteen separate and independent ex- 
periments with a total of twenty-seven classes- 
participating. Replication provides the only 
means of providing the experiment with its 
own appropriate estimate of error and of dim- 
inishing experimental error since the error 
is inversely proportional to the square root of 
N where N equals the number of replicates. 

The number of cooperating schools in this ex- 
periment furnishes a broad base for drawing 
conclusions and guards against accepting gen- 
eralizations uncritically which often results 
from the single isolated experiment. It will 

be noted in the results of the statistical analysis 
that the outcomes are not the same in all 
schools. The selection of a few specific schools 
for this experiment would have yielded results 
that would have been unwarranted by the findings 
of the comprehensive investigation. If exper- 
iments are to contribute to educational knowl- 
edge, significant educational problems should 
be attacked on a broad base. 


Each of the fifteen experimental units was a 
self-contained experiment since each had its 
own experimental group and control group. 
The evidence was determined for each exper- 
iment and collated in certain cases. These 
local control groups eliminate the errors re- 
sulting from the use of experimental and con- 
trol groups each in different localities or 
under different teachers. Registration methods 
in each of the participating schools provided 
for random, heterogeneous grouping of pupils 
and thus prevented bias from entering into 
the composition of any of the classes. By a 
random selection one of the classes in each 
school was designated as the control class 
and others as experimental classes. The ex- 
perimental classes in each school specify the 
groups which used certain visual aids as a 
part of their regular instruction. The control 
class specifies the group taught by the same 
teacher in the same school but for which no 
films or filmstrips were provided. The con- 
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trol classes were given conventional instruc- 
tion for an equivalent period of time which the 
experimental classes spent viewing films or 
filmstrips. Except for the visual materials 
used in the experimental classes, the experi- 
mental and control classes in each school had 
the same class time, activities, materials and 
tests. Each teacher was given extensive in- 
struction, orally and in writing, as well as 
supervision in order to assure uniformity in 
the use of the visual materials in the classes. 
A log sheet for each lesson was completed by 
each teacher so that differences in instruction 
could be taken into consideration in interpre- 
tations. 


The instructional units used in the plane 
geometry experiment were chosen to take ad- 
vantage of the potentialities of visual aids and 
at the same time be representative of mathe- 
matics subject matter. The instructional units 
in plane geometry were Circles and Loci. With 
the circle so closely related to the wheel and 
locus defined as the path of a point moving 
according to given conditions, both of these 
units contain material in which motion is a 
factor. 





In preparing the cooperating teachers for the 
instruction of these units, a personal conference 
was held with each, the procedure to follow was 
outlined in written form, and the teacher was 
visited during the course of the experiment. In 
most schools this personal visit included visit- 
ing the geometry classes. 


The procedure followed in the teaching of each 
experimental unit was outlined as follows: (1) 
Prior to the beginning of an experimental unit, 

a pretest was administered to the participating 
classes; (2) films or filmstrips were used in the 


experimental classes at the time and in the man- 


ner prescribed in the visual aids teaching guide; 
(3) control classes participated in worthwhile 
learning activities as a substitute for the visual 
aids being used by the experimental classes; 

(4) experimental and control classes spent the 
same amount of time learning the experimental 
units; (5) excluding the films and filmstrips, the 
control and experimental classes had the same 
instruction, the same assignments, the same 
supplementary materials or projects, the same 
activities and the same tests; (6) each partici- 
pating instructor was free to use any method, 
materials, assignments, activities, amount of 
time, or tests she desired for the experimental 
classes; (7) at the end of the unit, all classes 
were given an objective achievement test; (8) 
approximately two months after the completion 





of an experimental unit, the achievement test 
was repeated as a retention test. It is to be 
noted that the the units were not outlined and 
prescribed with respect to such items as objec- 
tives, subject matter content, time schedule, 
materials of instruction, or student activities. 


Otis Mental Ability Tests were administered 
to all students who participated in the study and 
the results of these tests were used as one of 
the independent matching variables. Achieve- 
ment in the two instructional units in plane geo- 
metry was measured by achievement tests which 
were constructed by the writer. These achieve- 
ment tests were used at the beginning of the ex- 
periments as pre-tests and again at the end of 
the experiments as post-tests. The achievement 
tests were also used as retention tests in some 
of the experiments. The tests were mailed to 
reach the school just prior to the time of admin- 
istration and were returned to the writer by ex- 
press as soon as possible after they had been 
administered. The tests were scored and results 
sent to the teachers within a few days. 





The evaluation was concerned with two types 
of visual aids, namely, sound films and film- 
strips. The films! used were Circles, Angles 
and Arcs of Circles, Chords and Tangents of 
Circles, and Locus. The filmstrips¢ used were 
Postulates-Triangles and Circles; Vocabulary- 
Circle I, Vocabulary-Circle I, and Locus. 
Learning guides were constructed so that the 
method of using films and filmstrips could be 
held as uniform as possible. These guides pre- 
scribed the time and method of using the teach- 
ing aids including discussion questions and a 
short test. This was done to ensure the use of 
the films in the manner found most effective by 
previous research. 

















In order to compare the effectiveness of films 
and filmstrips, the plane geometry classes were 
randomly assigned visual aids as follows: (1) 
Five experimental classes used only filmstrips, 
(2) seven experimental classes used only sound 
films, and (3) three experimental classes used 
filmstrips and sound films. Three schools each 
had three geometry classes and, thus, used film- 
strips in one class, sound films in another, while 
the third class was the control. 


The Statistical Analyses 


The students who participated in the experi- 
ments were given mental ability tests, achieve- 
ment pre-tests, and achievement post-tests, 
and where possible achievement retention tests. 
The scores from these tests were used as the 
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basis for the statistical analyses. 


In order to illustrate the specific statistical 
techniques used in this study, data from two of 
the experimental units will be presented and 
analysed. The statistical techniques which are 
widely known by research workers will not be 
elaborated. The analysis of variance and co- 
variance and the Johnson-Neyman technique are 
less familiar and thus will be discussed in some 
detail. The data selected for analysis will in- 
volve the data from two schools in which the ex- 
perimental classes used three films and three 
filmstrips as a supplement of the regular in- 
struction. The control classes used no films or 
filmstrips. 


The Johnson-Neyman Technique.--The signif- 
icance of the difference between means was 
computed for each independent experiment and 
is presented in Table I. 





The probability of the F value was required 
to be greater than .01 before applying the fol- 
lowing formula: 


t= ¥,-%, 


Ext +2x, \/1 +L 
N, +N, - x, N 


It will be noted that both of the t values are 
significant at the one percent level. This 
shows that the differences in means are sta- 
tistically significant and in favor of the exper- 
imental groups. 





Before pooling the data, the homogeneity of 
the variances was tested separately for the 
experimental and control groups. Since there 
were only two schools involved, the homogen- 
eity of variances was tested by Snedecor’s 
F ratio. The significance of the difference be- 
tween means of experimental and control 
classes was tested separately by analysis of 
variance. The obtained F values were less 
than the tabled values at the one percent level 
of significance. Since both the experimental 
and control groups did not differ in variability 
and the means of the respective groups did not 
differ significantly, the data were pooled for 
each of the groups, respectively, before apply- 
ing the Johnson-Neyman technique. This tech- 
nique was used to compare the pooled experi- 
mental classes with the pooled control classes 
on Part I Circle Retention-Test. This tech- 
nique made it possible, (1) to test the signifi- 
cance of the difference between the adjusted 
means of the experimental group and the con- 
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trol group as based on results obtained on 
Part I Circle Retention-Test, and (2) to set 

up a region of significance if a significant dif- 
ference was found to exist between the experi- 
mental and control groups. 


The hypothesis H tested in this case was the 
null hypothesis. It stated that there was not a 
significant difference between the means of the 
experimental group and the control group in 
the retention of the informational learning as 
measured by Part I Circle Retention-Test when 
the retention-test scores were adjusted on the 
basis of the Otis Quick-Scoring Mental Ability 
Test and the Part I Circle Achievement Pre- 
Test scores. 


The basic values required are as follows for 
Schools F and K: 


X = Mental Test Scores 

Y = Pre-test Scores of Part I Circle 
Achievement Test 

Z = Retention-Test Scores of Film and 
Filmstrip Group on Part I Circle 
Achievement Test 

U = Retention-Test Scores of Control 
Group 


Experimental Group Control Group 


N, =51 No = 44 
X, = 54.78 Xq = 52.40 
Y, = 9.45 Yo = 9.04 
Z, = 23.45 U = 17.04 
Sx,* 9.81 s Xo * 8.48 
s v1" 4.51 s y2* 4.49 
S, = 6.42 Sy = 7.88 
= .48 = 
xiY) x» Wp -46 
x42 61 Txpu = .30 
Tytz = 66 Tywu = -45 


The ratio between the relative and absolute 
minimum sums of squares provides a rigorous 
test of significance of the differences between 
means. In our case this ratio which is desig- 
nated as w was found to be 3.934. By interpol- 
ation from Table V in Johnson and Neyman’s 
publication’, we obtain W 9; = 12.850. Thus, 





3. Palmer 0. Johnson and J. Neyman, "Tests of Certain Linear Hypotheses and 
Their Application to Some Educational Problems," Statistical Research Memoirs, 


I (1936), pp. 57-93. 
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TABLE I 


RETENTION-TEST RESULTS WITH FILMS AND FILMSTRIPS--TESTS OF SIGNIFICANCE 
OF DIFFERENCES BETWEEN MEANS OF EXPERIMENTAL AND CONTROL CLASSES 
FOR PART I KNOWLEDGE OF FACTS CIRCLE TEST 






































F Values at 

School] Class| N Mean | S.D. F 05 1 t | Probability 
F Exp. | 27 25.0 | 6.56 "i 

Con. | 23 19.8 | 6.61 | 1.01 | 1.96 2.63 2.76 01 
K Exp. | 24 21.7 | 5.92 

Con. | 21 14.0 | 8.17 | 1.90 | 2.04 2.78 3.65 01 
Total |Exp. | 51 23.4 | 6.42 

Con. | 44 17.0 | 7.88 | 1.50 | 1.63 2.00 

Total | 95 20.4 | 7.79 

TABLE 1 


TEST OF HOMOGENEITY OF REGRESSION 




















df. Sum of Squares | Variance F 
Deviations from Average 
Regression Within 91 2394.1784 26.3097 
Deviations from Individual 
Regressions 90 2362.5955 26.2511 1.20 
Deviations Among Group 
Regressions | 1 31.5829 31.5829 
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since the observed value of W is less than the 
interpolated table value of W 01, we conclude 
that the experimental and control groups do 
differ significantly at the one percent level on 
Part I Circle Retention-Test. In other words, 
the audio-visual aids group was on the average 
significantly superior to the non-audio-visual 
aids group in informational learning as meas- 
ured by Part I Circle Retention-Test, where 
differences in pre-test and mental test scores 
were controlled. 


The region of significance which is repre- 
sented by an ellipse is shown in Figure 1. To 
the right of the line of non-significance, the 
value of F(x',y’) is favorable to the experimental 
groups. The value is significant at the one per- 
cent level for that part of the population en- 
closed within the ellipse. That is for all values 
of (x’, y’), the basic characters of matching-- 
mental ability and initial information--within 
the region of significance, the hypothesis H, 
(x’, y’) is rejected. To the left of the line of 
non-significance the value of F(x’, y’) is fav- 
orable to the control group but nowhere sig- 
nificantly so. 


The Analysis of Variance and Covariance.-- 





The analysis of variance and covariance makes 
it possible to partial out the effects of an in- 
itial lack of equivalence of the classes. It 
makes it possible to conduct an experiment 

with intact classes without matched pairs being 
involved. In this experiment mental test scores 
and pre-test scores were obtained so that it 
was possible to hold constant the effects of 
mental ability and initial information. 


The analysis of variance and covariance will 
be illustrated for the data from these same 
schools for the retention of applicational learn- 
ing. Similar to the data presented above, the 
differences between means in the retention of 
applicational learning were statistically sig- 
nificant in favor of the experimental classes. 
Since the requirements for pooling the data 
were satisfied, the analysis is completed for 
the data of the experimental and control class- 
es pooled separately. 


Before an analysis of variance and covariance 
can be completed, certain basic assumptions 
must be satisfied. One of these is the assump- 
tion of homogeneity of variance within classes. 
This homogeneity of variance can be tested by 
the Welch Nayer Lj test if more than two class- 
es are involved and by the F test if only two 
classes are being analysed. For the data of this 
analysis, the L; value, .958, exceeds the tabled 
value at the 5 percent level of significance and, 
this, the condition of homogeneity of variance 
wus satisfied. 
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Another basic assumption underlying the anal- 
ysis of covariance is that the regression co- 
efficients within groups are homogeneous. For 
the data at hand, the analysis of variance was 
used to test the homogeneity of the regression 
coefficients. As shown in Table Il, the F ratio 
of 1.20 was not significant and, thus, the con- 
dition of homogeneity of regression within 
groups was satisfied. This assumption may 
also be tested by the Welch-Nayer L, test. 


The analysis of variance and covariance sum- 
marized in Table ITI indicates that the difference 
between the mean of the pooled experimental 
group, and the mean of the pooled control groups 
is significant at the 1 percent level when the 
factors of mental ability and initial information 
are held constant. The difference is in favor of 
the experimental or audio-visual groups. Hence 
we can reject the hypothesis that there is no 
difference between the means of the experi- 
mental and control groups in the retention of 
applicational learning resulting from the use of 
the films and filmstrips as a complement of 
conventional instruction. 


Summary 


Since the scope of this experiment precludes 
the possibility of presenting all the data and 
analyses, the results are summarized in Tables 
IV and V. It will be noted in these tables that 
very few differences between the means of ex- 
perimental and control groups were statistically 
significant. There were differences in favor 
of experimental groups as well as control 
groups. These results point out how readily 
an incorrect conclusion might be drawn if a 
single experiment were conducted. Thus, it 
would seem important that cooperative experi- 
mentation is essential to furnish a broad base 
for conclusions and guard against accepting 
generalizations uncritically. 


The one area in which the results were con- 
sistently in favor of the experimental groups 
was in the retention of learning in those classes 
using three films and three filmstrips in the 
circle unit. These results have added signifi- 
cance when it is noted that these experimental 
classes were not significantly different on the 
post-test means. 


Implications 


In conclusion we wish to summarize by indi- 
cating certain implications which appear war- 
ranted as a result of this study. It should be 
noted that the study was, (1) designed to meas- 
ure the relative effectiveness of certain audio- 
visual materials in terms of unit objectives; 
(2) conducted under as nearly normal class- 
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TABLE Il 


ANALYSIS OF VARIANCE AND COVARIANCE OF PART IV CIRCLE RETENTION-TEST 
SCORES FOR SCHOOLS A AND K WITH MENTAL TEST SCORES AND PRE-TEST 
SCORES HELD CONSTANT 


A. Data for Analysis of Variance and Covariance 
x=Mental Test y=Pre-Test z=Retention-Test 





N x2 xy y2 yz z2 XZ 





Exp. 51 4820.6275 | 694.0980 | 1537.9216 | 611.4706 | 1857.1765 1194.4118 
Con. 44 3096.6363 | 299.5909 602.9773 85.7955 | 1039.1591 561.3182 
(7917.2638) | (993.6889) |(2140.8989) | (697.2661) | (2896.3356) |(1755.7300) 




















Total 95 8050.5263 |1069.4211 | 2183.9368 | 842.6632 | 3387.5368 | 2011.5789 





B. Adjustment Table for Analysis of Covariance 








Denominator by bo by xz bo yz 22 df. 
Exp. 6931975.124 | .203765 | .305632 | 243.37932 | 186.88498 | 1426.91220 | 48 
Con. 1777446.688 | .175959 | .054861 | 98.76899 | 4.70683 | 935.68328 | 41 


(2362.59548) 


15962643.73 -192072 | .236539 | 337.22657 | 164.93063 | 2394.17840 




















Total 16438179.16 -212432 | .281823 | 427.32373 | 237.48187 | 2722.73120 89 








C. Adjusted or Reduced 








Source of Sum of Mean 

Variation d.f. Squares Square F 

Between Groups 1 328.5528 328.5528 

Within Groups 91 2394.1784 26.3097 12.48 
Total 92 2722.7312 
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Y = CIRCLES PRE-TEST SCORE 
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Figure 1 


Comparison of the experimental students (x) and the control students (0) in 
achievement on Part I, Knowledge of Facts, Circles Retention-Test, when 
matching was affected on the Otis Quick-Scoring Mental Ability Test and 
Part I Circles Pre-Test. 
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TABLE IV 


SUMMARY OF THE DIFFERENCES BETWEEN EXPERIMENTAL AND CONTROL CLASSES 
ON POST-TESTS 


















































Circle Unit Experimental treatment 

3 Films 

3 Filmstrips | Film and Filmstrip Films Filmstrips 

Exper. (Control Exper. /Control Exper. (Control 
Type of No Sig. | Groups |Groups |No Sig.| Groups | Groups| No Sig.| Groups | Groups 
Learning Differ. |Superior Superior} Differ.|Superior| Super. |Differ. |Superior|Superior 

Informational 3 0 0 6 1 0 5 0 0 

Problem 

Solving 3 0 0 7 0 0 5 0 0 

Applica- 

tional 2 1 0 5 2 0 + 0 1 
Total 8 1 0 18 3 0 14 0 1 

Locus Unit 

1 Film 

1 Filmstrip 

Informational 3 0 0 7 0 0 5 0 0 

Problem 

Solving 2 1 0 6 0 1 5 0 0 

Applica- 

tional 3 0 0 6 0 1 5 0 0 
Total 8 1 0 10 0 2 15 0 0 









































(M: 














(March 1949) JOURNAL OF EXPERIMENTAL EDUCATION 371 


TABLE V 


SUMMARY OF THE DIFFERENCES BETWEEN EXPERIMENTAL AND CONTROL CLASSES 
ON RETENTION-TESTS 





















































Circle Unit Experimental treatment 

3 Films 

3 Filmstrips Film and Filmstrip Films Filmstrips 

Exper. |Con. Exper.| Con. Exper. | Con. 
Type of No Sig.|Group |Group_ | No Sig.|Group | Group | No Sig.|Group | Group 
Learning Differ. | Superior] Superior Differ.|Super.| Super. | Differ. | Super. | Super. 

Informational 0 2 0 3 0 0 1 1 0 

Problem 

Solving 0 2 0 3 0 0 2 0 0 

Applica- 

tional 0 2 0 3 0 0 1 0 1 
Total 0 6 0 0 0 0 + 1 1 

Locus Unit 

1 Film 

1 Filmstrip 

Informational 2 0 0 2 0 1 2 0 0 

Problem 

Solving 2 0 0 3 0 0 2 0 0 

Applica- 

tional 2 0 0 2 0 1 2 0 0 
Total 6 0 0 7 0 2 6 0 0 
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room situations as possible; (3) conducted at 
two high school levels; (4) made with three 
types of audio-visual aids; and (5) made in 
schools which were selected in a random man- 
ner. It should be further noted that this study 
attempted to evaluate the relative effective- 
ness of audio-visual material as they affected 
informational learning, the learning of problem 
solving skills, applicational learning and the 
retention of learning in these areas. The im- 
plications are as follows: 


1. It appears that audio-visual aids which 
are developed for use in mathematics classes 
might be more effective as aids to learning if 
they were designed to supplement rather than 
repeat the type of instruction which the students 
have in the typical mathematics classroom. The 
audio-visual aids used in this study contained 
material which was frequently similar to ma- 
terial presented in mathematics textbooks. 
Some of the material presented by the films 
used animated drawings that were similar to 
the type of illustrations which mathematics 
teachers customarily draw on the blackboard. 
Experimental results might be considerably 
more favorable to the effectiveness of visual 
materials if the audio-visual aids would bring 
to the classroom unique experiences that are 
vital and functional to the adolescent student, 
that is, different from experiences supplied 
by textbook or teacher or by other non-visual 
aids. For example, a motion picture film 
could show the geometry used in aviation or a 
filmstrip could illustrate the geometric designs 
used by nature in snowflakes, bee cells, flowers, 
seeds or crystals. 





JOURNAL OF EXPERIMENTAL EDUCATION (Vol. xvi) 


2. An implication from this study would in- 
dicate that if we are to secure audio-visual 
aids that actually enhance learning, careful ex- 
perimental studies should be made testing the 
claims made for them before they are made 
available for instructional purposes. The num- 
ber of experiments which have resulted in sig- 
nificant differences in means have been rela- 
tively few. And even when statistically signif- 
icant results were obtained, one might raise 
the question as to whether the results are ped- 
agogically significant. Additional experimenta- 
tion is also needed to test the effectiveness of 
other types of materials of instruction, such as 
charts, opaque projections, and stereographs. 


3. On the basis of the cooperation obtained 
from the administrators and teachers of partic- 
ipating schools, it appears that experimentation 
in randomly selected schools is entirely feas- 
ible. The teachers who participated exhibited 
considerable interest in the experiment and 
cooperated in following directions and sugges- 
tions in a highly satisfactory manner. However, 
most of the participating teachers lacked train- 
ing and experience in methods of research and 
in methods of using visual materials. Further- 
more, the time available to teachers in view of 
the usually heavy teaching load limits the extent 
to which they can prepare adequate lessons us- 
ing available materials. There is also a need 
for the coordination of means of informing 
teachers of the materials that are available. 
Many of the teachers participating in this 
study had never used or seen a mathematics 
film or filmstrip before. 














AN EXPERIMENTAL EVALUATION OF THE RELATIVE EFFECTIVE- 
NESS OF CERTAIN AUDIO-VISUAL AIDS IN 
VOCATIONAL AGRICULTURE* 


RALPH R. BENTLEY 
Purdue University 


The purpose of this experimental study was, 
(1) to determine the relative effectiveness of 
certain audio-visual aids on informational 
learning and on the ability to solve problems or 
make application of principles in vocational ag- 
riculture, and (2) to determine the relative ef- 
fectiveness of certain audio-visual aids on the 
retention of these two types of learning in vo- 
cational agriculture. 


The Design of the Experiment 





The selection of the cooperating schools.--The 
schools which cooperated in this study were 
selected with the aid of the Agricultural Educa- 
tion staff of the University of Minnesota and the 
Minnesota State Department of Agricultural Ed- 
ucation. The cooperating schools were located 
in the southern half of Minnesota and were 
selected on the basis of the following criteria: 
(1) the agricultural area in which the school was 
located; (2) the ability and success of the local 
teacher of agriculture and his interest and will- 
ingness to cooperate in the proposed experi- 
mental study; (3) the number of students en- 
rolled in the agriculture classes of each school; 
and (4) the cooperation which might be expected 
from the administrators in each school. 





The selection of the experimental projects.-- 
The instructional units which served as the 
basis for the three experimental projects were 
in home gardening, swine production, and pas- 
ture production. The units were selected be- 
cause (1) they were generally taught to classes 
in vocational agriculture; (2) suitable audio- 
visual aids were available for these units; (3) it 
was possible to measure the effectiveness of 
audio-visual aids in both crop and livestock 
enterprises; and (4) these units were usually 
taught to students of different ages. 





The selection of the audio-visual aids.--The 
audio-visual aids used in this study were se- 
lected by the writer who was assisted by audio- 
visual education specialists of Iowa State College 
and of the University of Minnesota. The follow- 
ing factors were taken into consideration in 








selecting the audio-visual aids: (1) the educa- 
tional level for which they were suited; (2) the 
subject-matter content; (3) the photography 
and technical construction, {4) the color of the 
pictures; and (5) the physical condition of the 
films and slides. 


The independent experiments.-- The investiga- 
tion was made in eight different schools and 
consisted of twelve independent experiments. 
Each of the twelve independent experiments had 
its own experimental and control groups. Six of 
the experiments were in the Home Garden Pro- 
ject, four in the Swine Production Project and 
two in the Pasture Production Project. Altho- 
ugh the independent experiments, in each of the 
three experimental projects were complete ex- 
periments within themselves there were certain 
features which were common to the independent 
experiments in each experimental project. The 
common features were, (1) all students ina 
given experimental project were taught the 
same instructional unit; (2) all students ina 
given experimental project were administered 
the same tests with the exception of the reten- 
tion test which was not administered in certain 
independent experiments; (3) the experimental 
groups in each of the experimental projects had 
the same audio-visual aids as part of their in- 
struction; and (4) the instructional time, in 
each of the independent experiments, was equal 
for the experimental and control groups. 





The experimental and control groups.--In 
each independent experiment there was an ex- 
perimental or audio-visual aids group anda 
control or non-audio-visual aids group. Where 
two sections of the same agriculture class were 
taught in the same school, one section was used 
as the experimental group and the other as the 
control group. The treatment in each case was 
determined by lot. Most of the experiments, 
however, were conducted in schools where 
there was only one class section. In such 
instances the experimental and control groups 
were obtained by dividing the classes at random. 
The instruction of both groups, in each of the 








* A summary of a thesis submitted to the faculty of the Graduate School of 
the University of Minnesota in partial fulfillment of the requirements for 


the degress Doctor of Philosophy, 1947. 


373 








374 JOURNAL OF 


independent experiments, where divided class- 
es were used. was under the direct supervision 
of the same instructor and the groups were 
taught as a unit except when the audio-visual 
aids were shown. 


The testing program.--The Terman-McNemar 


Test of Mental Ability was administered to all 
students who participated in this experimental 
study and the results were used as one of the 
independent matching variables. Achievement 
in the three experimental projects, Home Gar- 
dening. Swine Production, and Fasture Fro- 
duction, was measured by achievement tests 
which were constructed by the writer and were 
designed to measure in terms of the objectives 
of each respective unit. These tests were 
used before the instructional period as pre- 
tests, at the end of the instructional period as 
post-tests in all experimental projects and as 
retention tests in certain of the Home Garden 
and Swine Production experiments. 


The Statistical Analysis 


The statistical techniques used in this study 
are as follows: (1) the arithmetic mean was 
used as a measure of central tendency and the 
standard deviation and variance were used as 
measures of variability. (2) When it was 
necessary to test the homogeneity of data for 
two groups, the t-test was used to determine 
the significance of the difference between means , 
and the F-test was used to determine the signif- 
icance of the difference between variances. 

(3) When it was necessary to test the homogen- 
eity of data for more than two groups, the anal- 
ysis of variance was used to determine the sig- 
nificance between means and the Welch-Nayer 
(Ly test) was used to determine the significance 
of the difference between variances. (4) The 
data for the independent experimental groups in 
each experimental project were pooled whenever 
it was shown that their means did not differ sig- 
nificantly and that their variances were homo- 
geneous. The control groups were treated 
similarly. (5) The analysis of variance and 
covariance was used to test the significance of 
the difference between the means of the exper- 
imental and control groups on the retention 

test. (6) The Johnson-Neyman technique! was 
used to compare the pooled experimental groups 
and the pooled control groups in the Home Gar- 
den and the Swine Froduction experimental pro- 
jects. This technique makes possible the com- 
parison of the means of the two groups after 
their adjustment for any differences in the basic 
characters of matching which in this experiment- 
al study were the Terman-McNemar Test of 





Mental Ability scores and the achievement pre- 
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test scores. If there proves to be a non-sig- 
nificant difference between the means, the 
analysis is not carried further. However, if 
there is a significant difference between the 
means after adjustment then the analysis is 
continued to set up a region of significance. The 
region of significance enables one to determine 
all the systems of matching variables for which 
a difference would be found between the experi- 
mental and control groups. That is, it deter- 
mines for what kind of students as defined by 
the matching variables a significant difference 
would be found in favor of one or the other of 
the experimental and control groups. 


The Home Garden Experimental Groups 


The Home Garden Experimental Project con- 
sisted of six independent experiments each of 
which had its own experimental and control 
groups that were taught by the same instructor, 
used the same reference materials, spent an 
equal amount of time on the home garden unit 
and were administered the same tests at the 
same time. The instruction of the two groups 
in each experiment was identical except that the 
control group studied and discussed the home 
garden unit for a time equal to that which was 
used in presenting the movie to the experimental 
group. The movie entitled ‘‘The Farm Garden”’ 
and the film strips ‘‘Gardening for Victory"’ 
Parts I and II were the audio-visual aids used in 
the Home Garden Project. Achievement was 
measured in this experimental project by the 
Home Garden Achievement Test of which Part 
I was designed to measure informational learn- 
ing and Part II applicational learning. On the 
basis of the separate post-test results for Fart 
I and Part II the Data of each part were pooled 
for the six independent experimental groups 
when it was shown that their variances were 
homogeneous and that the differences among 
their means were not significant. This was 
likewise true of the variances and means of the 
six independent control groups. 


The Application of the Johnson-Neyman Tech- 
nique.--The Johnson-Neyman technique was used 
on the pooled data (1) to test the significance of 
the difference between the adjusted means of 
the audio-visual aids or experimental group 
and the non-audio-visual aids or control group 
as based on results obtained on Part I of ihe 
Home Garden Achievement F ost-test, and (2) to 
set up a region of significance if a significant 
difference was found to exist between the exper- 
imental and control groups. Table I contains 
the basic statistics which were used in making 
application of the Johnson-Neyman technique to 
the results of Part I of the Home Garden Achiev- 








1. Palmer 0. Johnson and J. Neyman, “Tests of Certain Linear Hypotheses and 
Their Application to Some Educational Problems," Statistical Research 
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TABLE I 


STATISTICS COMPUTED FOR THE APPLICATION OF THE JOHNSON-NEYMAN 
METHOD OF ANALYSIS OF HYPOTHESIS Hy 





Calculated Data 


1. Notations 
Dependent Variable 
Independent Variables 


Z,: Part I Home Garden Achievement Post-test 
X,: Part I Home Garden Achievement Pre-Test 
Ys : Terman-McNemar Test of Mental Ability 
Frequencies Ns 

where s = 1, 2 denote Experimental and Control Data, respectively 








2. Results 
N,= 73 Ng = 73 
EZ, = 2145 £ Zo = 2038 
> z: = 65235 is z3 = 59592 
xX X, = 1739 XL Xe = 1710 
LC xi = 44001 c x5 = 42584 
LY; = 6527 X Yo = 6599 
c Y; = 619977 c Y> = 637364 


LA X, = 52415 
CZ Y, = 197076 


161021 


CY; 


Mean % = 29.38 
S.D. % = 5.54 
Mean X = 23.82 
S.D. X,= 5.84 
Mean Y; = 89.41 


S.D. Yj, = 12.02 


I % Xp = 49346 
X BH Yo = 189350 
I X% Yo = 159117 


Mean Z = 27.92 
S.D. % = 6.12 
Mean Xo = 23.42 
S.D. X_= 5.93 
Mean Yo = 90.40 


S.D. Yo = 12.95 
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ement Post-test. 


The hypothesis Hj, tested in this case was the 
null hypothesis. It states that there is not a 
significant difference between the means of the 
experimental or audio-visual aids group and the 
control or non-audio-visual aids group in infor- 
mational learning as measured by Fart I of the 
Home Garden Achievement F ost-test, when the 
post-test scores are adjusted on the basis of 
the Terman-McNemar Test of Mental Ability 
and Part I of the Home Garden Achievement 
Pre-test scores. 


The ratio between the relative and absolute 
minimum sums of squares provides a rigorous 
test of significance of the differences between 
means. In our case, this ratio which is desig- 
nated as u was found to be .8134. Py interpol- 
ation from Table V in Johnson and Neyman’s 
article“ we obtain u 9; = .9112. Thus, since 
our observed value of u is less than the inter- 
polated value of u_g;, we conclude that the ex- 
perimental and control groups do differ signif- 
icantly at the 1 percent level on Fart I of the 
Home Garden Achievement fF ost-test. In other 
words, the audio-visual aids group was on the 
average significantly superior to the non-audio- 
visual aids group in informational learning as 
measured by Fart I of the Home Garden Achieve- 
ment Fost-test. 


The region of significance which is repre- 
sented by a hyperbola is shown in Figure 1. 
Since there are two branches of the hyperbola 
we have the interesting situation whereby for 
every system of values X‘Y‘ in the left branch 
of the hyperbola the H;(X'Y') is rejected in 
favor of the experimental group, and for every 
system of the values X'Y‘ in the right branch 
of the hyperbola the H;(X'Y') will be rejected 
in favor of the control group. Study of the 
region of significance indicates that the experi- 
mental group may be said to be superior to the 
control group throughout the range of mental 
ability of the students and for the average or 
below average student as measured by Part I 
of the pre-test. 


The Johnson-Neyman technique was also used 
to test the significance of the difference between 
the adjusted means of the experimental and 
control groups as based on the results obtained 
from Part II of the Home Garden Achievement 
Post-test. It will be recalled that Part IT was 
designed to test applicational learning. Table 
Tl gives the basic statistics which were used in 
applying the Johnson-Neyman technique to the 
results of Part II of the Home Garden Achieve- 
ment F ost-test. 
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The ratio between the relative and absolute 
minimum sums of squares which has been des- 
ignated as u, is .9683. By interpolation from 
Table V in Johnson-Neyman’s article” we obtain 
u_g1 = -9112. Since our observed value of u is 
greater than the interpolated table value for 
u_g, we conclude that the adjusted means of the 
experimental and control groups do not differ 
significantly at the 1 percent level of signifi- 
cance. Thus, the audio-visual aids group and 
the non-audio-visual aids group were equal in 
applicational learning or in their ability to 
solve problems as measured by Part TI of the 
Home Garden Achievement Post-test. 


The Home Garden Achievement Retention- 
test was administered in two of the independent 
experiments, and the results were analyzed by 
the analysis of variance and covariance with 
the post-test results held constant. The differ- 
ence between the means of the experimental 
and control groups for Fart I of the Home Gar- 
den Achievement Retention-test was not signif- 
icant. This was likewise true of the experi- 
mental and control groups as measured by 
Fart II of the Home Garden Achievement Reten- 
tion-test. 


The Swine Production Experimental Froject 


The Swine Production Experimental Project 
consisted of four independent experiments, each 
having its own experimental or audio-visual aids 
group and control or non-audio-visual aids 
group. The groups in each independent experi- 
ment were taught by the same instructor. used 
the same reference materials, spent the same 
length of time on the swine unit. and were ad- 
ministered the same tests at the same time. 


In each of the independent experiments the ex- 
perimental and control groups were taught as a 
unit except for time necessary to show the 
movie ‘‘Pig Frojects Make Profits’’.to the ex- 
perimental group. During the time the movie 
was being shown the control group continued its 
study of the swine unit as directed by the local 
teacher of agriculture. 


The Swine Production Achievement Test was 
administered as pre-test and post-test in all 
independent experiments and as retention test 
in three of the independent experiments. 


It was statistically permissible to pool the 
data of the four independent experimental groups. 
This was also true of the four control groups. 
The Johnson-Neyman technique was used to test 
the significance of the difference between the 
post-test means of the pooled experimental 
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TABLE Il 


STATISTICS COMPUTED FOR THE APPLICATION OF THE JOHNSON-NEYMAN 
METHOD OF ANALYSS OF HYPOTHESS Hy 





Calculated Data 

1. Notations 
Dependent Variable 
Independent Variables 


Frequencies 


Z;: Part II Home Garden Achievement Post-test. 
X,: Part Il Home Garden Achievement Pre-test. 
Y;: Terman-McNemar Test of Mental Ability 

Ns 


where s = 1, 2 denote Experimental and Control Data, respectively. 


2. Results 
N, = 73 
I Z, = 1960 
< 2? = 63592 
XX, = 1695 
= x? = 47679 


© ¥, = 6527 


= ¥? = 619977 


= ZX; = 50024 
= ZY, = 183757 


x X,Y, 155555 


Mean Z, = 26.85 
S.D. 2, = 12.34 
Mean X, = 23.22 
S. D. Xy = 10.75 
Mean Y, = 89.41 


S. D. Y; = 12.02 


No = 73 
E Zo = 1826 
< 2g = 52260 
<X Xq = 1601 
© x§ = 40149 
< Y2 = 6599 
I y$ = 637364 
= Z_Xo = 42734 
© Ze¥q = 171674 
E X2¥2 = 145854 


Mean Zo = 25.01 
S.D. Zg= 9.56 
Mean Xo = 21.93 
S.D. Xg= 8.36 
Mean Y2 = 90.40 


S. D. Yo = 12.95 
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groups and the pooled control groups. Table III 
gives the basic statistics which were used in 
applying the Johnson-Neyman technique to the 
results of the Swine Production Achievement 
Post-test. 


The ratio between the relative and absolute 
minimum sums of squares in this case, or u, 
is .9954. By interpolation from Table. V in 
Johnson-Neyman's article? we obtain u.01 = 
.1936. Thus, since our observed value of u 9 
is greater than the interpolated value we con- 
clude that there is not a significant difference 
between the means of the experimental group 
and the control group. In other words the mean 
achievement of the audio-visual and the non- 
audio-visual aids groups as measured by the 
Swine Production Achievement Test was not 
significantly different. 


The results of the Swine Production Achieve- 
ment Retention-test were analyzed by the analy - 
sis of variance and covariance with the post- 
test results held constant. The difference be- 
tween the means of the experimental and control 
groups was not significant as measured by the 
retention test. 


The Pasture Production Experimental Project 





The Pasture Froduction Experimental F roject 
consisted of two independent experiments in 
which three sets of kodochrome pasture slides 
with descriptive notes were used as part of the 
instruction of the experimental group. The ex- 
perimental and control groups in each independ- 
ent experiment were taught as a unit except dur- 
ing the time when the audio-visual aids were 
shown. The control groups in each experiment 
were taught separately from the experimental 
group for a period of time equivalent to that 
used in showing the slides to the experimental 
groups. The instruction of both groups in each 
independent experiment was supervised by the 
same instructor. 


It was statistically permissible to pool the 
data for the two independent experimental 
groups and it was also possible to pool the 
data for the two control groups. Applying the 
t-test of significance to the pooled data for 
Part I of the Fasture Production Achievement 
Post-test results, it was found that the means 
of the experimental and control groups were 
not significantly different. This was likewise 
true of the results for Part II of the Fasture 
Production Achievement Post-test. 


Conclusions 


The conclusions resulting from the three ex- 
perimental projects in this investigation are 





as follows: 


1. The Home Garden Experimental Fro- 
ject. 


a. The mean of the experimental or 
audio-visual aids group was signif- 
icantly higher than the mean of the 
control or non-audio-visual aids group 
in informational learning as measur- 
ed by Part I of the Home Garden 
Achievement Post-test. 


b. The. region of significance indicated 
that the mean of the experimental 
group may be said to be significantly 
superior to the control group through- 
out the range of mental ability of the 
students and for the average or be- 
low average student as measured by 
Part I of the Home Garden Achieve- 
ment Pre-test. 


c. The mean of the experimental group 
was not significantly different from 
the mean of the control group in appli- 
cational learning as measured by Part 
II of the Home Garden Achievement 
Post-test. 


d. In the two independent experiments 
where retention was measured, the 
means of the experimental and control 
groups were not significantly different 
in either informational or application- 
al learning as measured by Part I and 
II of the Home Garden Achievement 
Retention-test, respectively. 


. The Swine Production Experimental 
Project. 


a. The mean of the experimental group 
was not significantly different from 
the mean of the control group in 
achievement as measured by the 
Swine Production Achievement Post- 
test. 


. The mean of the experimental group 
was not significantly different from 
the mean of the control group in 
achievement as measured by the 
Swine Production Achievement Re- 
tention-test. 


3. The Pasture Production Experimental 
Project. 


a. The mean of the experimental group 
was not significantly different from 
the mean of the control group in in- 





4. Johnson-Neyman, op. cit. 
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TABLE Ifl 


STATISTICS COMPUTED FOR THE APPLICATION OF THE JOHNSON-NEYMAN 
METHOD OF ANALYSIS TO HYPOTHESS Hm 





Calculated Data 


1. Notations 


Dependent Variable 
Independent Variables 


Z,: Swine Production Achievement Post-test. 
X,: Swine Production Achievement Pre-test. 
Ys: Terman-McNemar Test of Mental Ability. 


Frequencies Ns 
where s = 1, 2 denote Experimental and Control Data, respectively. 


2. Results 
N, = 37 N, = 42 

EZ, = 2117 LZ, = 2337 
= 2? = 126157 X 2g = 136455 
I X, = 1568 I Xq = 1819 
L x? = 71224 CL xg = 85053 
LY, = 3295 I Y, = 3936 
= y¥? = 310819 < 3 = 388898 


= ZX, = 92845 


194233 


) X,Y, = 145758 


Mean Z, = 57.22 
S.D. Z, = 11.82 
Mean X , = 42.38 
S. D. X, = 11.76 
Mean Y, = 89.03 


8. D. Y¥, = 11.50 


= XoYo = 176712 


Mean Z, = 55.67 
S. D. Zy = 12.51 
Mean Xp = 43.31 
S. D. Xq = 12.36 
Mean Y, = 93.71 
S.D. Y, = 11.73 
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formational learning as measured 
by Part I of the Fasture Froduction 
Achievement Fost-test. 


b. The mean of the experimental group 
was not significantly different from 
the mean of the control group in 
applicational learning as measured 
by Part II of the Fasture Production 
Achievement Post-test. 


Implications 


In conclusion we wish to summarize by indi- 
cating certain implications which appear to be 
warranted as a result of this study: 


1. Audio-visual aids which are developed 
for use in vocational agriculture classes 
would likely be more effective as aids 
to learning if they were designed to sup- 
plement rather than repeat the visual 
experiences the students have had. 





2. Audio-visual aids which are designed 


to teach new ideas or principles should 
be accompanied with adequate interpre- 
tation either by means of sound record- 
ing or by the instructor. 


. The catalog descriptions of audio-vis- 


ual aids were not dependable guides to 
follow when selecting audio-visual mater- 
ials which were effective aids to learning 
in classroom situations. 


. If we are to secure audio-visual aids 


that actually enhance learning, careful 
experimental studies should be made, 
testing their value before they are made 
available for instructional purposes. An 
analogy may be made here from the field 
of agriculture. Namely, no hybrid corn 
is recommended without a careful exper- 
imental measure of its yielding ability. 
Why shouldn’t the same critical attitude 
be exercised in the selection of audio- 
visual aids which are to be used in learn- 
ing situations? 
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NOTE ON ESTABLISHING NORMS IN A 
LARGE CITY SYSTEM 


J. A. KEATS 
Australian Council for Educational Research 


In his article in the March 1948 number of the 
journal of Experimental Education, H. L. Stein! 
suggested how the problem of establishing meas- 
ures of central tendency and variability for a 
school system could be met. The solution offer- 
ed for finding a measure of central tendency is 
of practical interest, although an indication of 
the extent of the bias caused by heterogeneity 
of variance in the schools would have been wel- 
come. However, the solution given for an un- 
biased estimate of population variance is prob- 
ably not unbiased. The formula quoted from 
Hoel gives an unbiased estimate of the within 
schools variance, and since this variance has 
been shown by Stein to be heterogeneous, i.e. 
varying by an amount greater than would be 
expected by chance, there is little point in es- 
timating it. A more exact procedure for ob- 
taining an unbiased estimate of the population 
variance is given below; but all such proced- 
ures must ignore the heterogeneity of school 
variances. 








Since the derivation of the formulae neces- 
sary for this procedure is not easy, it has been 
included in the appendix. However, the ideas 
behind the mathematics are fairly straight- 





more than can be attributed to chance and in 
general will have a smaller range. Further- 
more, the degree of homogeneity will differ 
from school to school, and we must expect 
differences in standard deviations which are 
greater than chance, as has been demonstrated 
by Stein. 


The ideas behind the formulae involve the 
following steps: 


1. To test whether or not the means of 
schools differ from each other by more 
than can be expected by chance fluctu- 
ation. (Analysis of Variance--test the 
between schools against the within 
schools variance.) 


2. If a positive result is obtained in 1, 
above, then to estimate the amount by 
which the pooled within schools vari- 
ance has been reduced by the selective 
factors mentioned above and hence ob- 
tain an unbiased estimate of the popu- 
lation variance. 


Re-Analysis of the Data Quoted by Stein 





Analysis of Variance 





S.of S. Ds. of F. Variance F 

Between Schools 65,620.00 45 1,458.2222 7.2 
Within Schools 426,013.222 2095 203.3476 (6 = 14.26) 
Total 491,013.22 2140 229.7351 (o = 15.16) 


forward, and if they are acceptable the alge- 
bra can be verified at leisure. In most sur- 
veys of the type quoted it is found that there 
is a highly significant difference between the 
means of some of the schools tested. This is 
not surprising as it is obvious that a school 
will in the main serve the district in which it 
is situated and that this district will contain 
predominantly members within a limited 
socio-economic range; it is to be expected 
then that the scores of children of sucha 
school in any given test will often have a mean 
which will differ from the population mean by 





Stein has shown that the conditions under- 
lying the F test are not fulfilled, since the 
schools cannot be regarded as homogeneous 
with respect to variability. However, an em- 
pirical study by Lindquist2 (Chapter V, Sec- 
tions 8 and 9) suggests that when the effect 
of this heterogeneity is taken into account an 
F45,2095 = 7.2 would almost certainly be sig- 
nificant at the .1% point. 


We now proceed to estimate the amount the 
variance of the population has been reduced 
by the selective factors operating. 
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Let Vj represent the component of var- Again from the formulae derived in the appen- 
iance within schools and Vps the com- dix 
ponent of variance between schools. r 
ne Vx =Vi+ Vbs xE ng Formula A 

n = -M;)(r-1 nr S= 
Ybs (Mos i )( ) n — 
(m- £ ng) 
s=1 when Vx is the variance of the estimate 
n of the population mean. 


where Mj, is the variance within schools 


Mpg is the variance estimated 
from the means of schools 


n is the number of children 


ng _ is the number of children 
in the sth school 


r is the number of schools 


Vis = 1,254.8746 x 45 
101,833 


(2141 - 2141 ) 
= 47.6 
= 56469.357 
2093.4 
= 26.975 








and the best estimate of the population 
variance is 


M, = Vj = 203.3476 
Vps+ =_ 26.975 
230.3226 


oO = 15.18 which is only slightly 





greater than the estimate obtained from 
the whole distribution, but somewhat 
greater than the 14.26 obtained from the 
within schools. 


The Standard Error of the Estimate of 
the Population Mean 


From the estimates of the two components, 
i.e. within and between schools, it is a short 
step to estimate the standard error of the esti- 
mate of the population mean, It is assumed 
here that the 46 schools tested are a random 
selection from an infinite (or at least very large 
c. f. 46) number of schools associated with the 
City of Winnipeg, with a similar assumption 
with regard to the children tested in a given 
school. (It may be argued that the 2,141 cases 
are the total population in the particular age 
group, in which case the population variance is 
that of the 2,141 cases, and the population mean 
is known exactly, i.e. has no standard error.) 











= 203.3476 , 26.975 x 47.6 
2if1 2141 

= .0950 + .6997 

= .1947 


i.e. the standard error of the population 
mean obtained from the total distribution 
is .891. and since this estimate is 98.03 
the weighted mean obtained by Stein, 99.38, 
does not differ significantly from it. How- 
ever, 99.38 is the better estimate to use. 


The technique used in this solution has been 
applied to data obtained by the Australian Council 
for Educational Research for the purpose of com- 
paring the performance of students from the six 
states of Australia. The scores of 35,000 child- 
ren from 600 schools have been examined and 
the standard errors of estimated State means 
(calculated as above) used to test the significance 
of the differences obtained. The possibility of 
using similar methods in factor studies based on 
heterogeneous populations is at present being 
examined. 


The writer is indebted to E. J. Williams for 
suggesting the method of approach used in this 
article. 
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Appendix 
Derivation of Formulae: 


Let xjs represent the score of the ith person 
in the sth school. 
ng the number of children in the sth school. 
r the number of schools. 
r 
Then the total number of children n= £ ng 
s=1 





(Ma! 


Let 


wee 
squa 
meal 
the e 
being 


Then 
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Ng 
the mean of the sth school %, =C xj, 
i=1 
Ns 
the mean for the sample (estimate of the 
population mean) 


Vj and Vps represent two components of the 
population variance -- Vj due to differences 
between pupils within a school and Vp, due 
to differences between schools caused by 
selective factors which vary from school to 
school. Then since these two factors are 
independent the population variance is equal 
to Vj + Vbs- Then since these two factors 
are independent the population variance is 
equalto Vi +Vp,- Then Vx, =Vi + Vbs 


— 


Le 





—— . 

Ve= f nove = 5.97 eV 

x= 21% Vag ai *™ 
n? n2 


- 
= Vi +Vbs +25 (Formula A) 
n n# 








To obtain formula for estimating Vi and Vbs 
we examine the expected values of the sums of 
squares within schools and between school 
means and equate these two expected values to 
the estimates of them obtained from the sample 
being examined. 


Then E(si) - the expected value of the pooled 
sum of squares within schools 


r 


-2 





21 (83 ~ 1) Vi=(n-r) Vi 
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i.e. Vy = E(sj) and hence an unbiased estimate 
aS 
of Vy is the pooled within schools variance, 


the value used, erroneously I believe, by 
Stein for the population variance, and 





E(Sps) = E, ng Vii) - 9 Vz 
= £ (Vj + Ms Vig) - Vi - Vos Eng 
s=1 s=1 


= Yj (r - 1) + Vg (n -Eng) 








n 
. V = E(S )-V (r- 1) 
s =: Ea) 
n 


Let Mj represent the estimate of variance ob-. 
tained by pooling the within schools 
variance. 


Mps the estimate of variance obtained from 
the variance of the school means. 








then Wi = Mj 
Ybs = — ~a )(r - 1) (Formula B) 
n 


If all the ng are equal (i.e. same number of 
children in each school) and equal fig say, 
then Ng Vbs = Mps ~ Mj 
and Vx= Mps =Mps_ (a formula recommend- 

r.Ng n 
ed by E. F. Lindquist? but which should be used 
only when the condition of equal numbers in 
schools is satisfied.) 
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A GRAPHICAL REPRESENTATION OF 
TEST RELIABILITY 


JOHN A. JARVIS 


Stout Institute 


The Kuder-Richardson short formula* pro- 
vides an easy means of graphically represent- 
ing the relationship between the reliability of a 
test and the standard deviation of that test. This 
formula appears below: 


2 
e O- npg 


rtt n 
o* 


n-1 
where: 


Tet reliability of the test 


n number of items 

0 the standard deviation 
p the mean divided by n 
q ee 


The variables p and q may be eliminated from 
the original formula in this manner: 

let x npq 
but q l-p 
then x np (1 - p) 
x = np- np2 


Taking the first derivative of X with respect 
to p 





dx = n-2np 
dp 
Since npq is to be a maximum, let 


dx = 0 
dp 

Then 
0 = n-2np 
2np= n 
p = 24or 5 
q = 1-5 
q = O 


The expression npq will then be a maximum 
when p and q are equal to .5. Substituting this 
value for p and q we have 





rit =_D_ - on (.5)(.5) 
n-l a? 
==. o*- -25n_ 
n-1 a2 


Using the above formula it is possible to con- 
struct the family of curves included as Fig- 
ure 1, This graph then provides a means of 
visualizing the increase in reliability with an 
increase in variability and a rough check on 
test reliability when computed by other means. 





> fo 


G. F. Kuder and M, W. Richardson, 


liability," Psychometrika, II (September 
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"The Theory of the Estimation of Test Re- 


1937), pp. 151-160. 
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